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structures  from  word  candidates  found  in  all  parts  of  the  input 
utterance.  These  local  structures  are  used  as  "islands  of  relia¬ 
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A  LOCALLY  OHGAN1ZED  PARSER  FOR  SPOKEN  INPUT 


! .  INTRODUCTION 


A.  BRIEF  OVERVIEW 

Recognizing  a  spoken  utterance  by  machine  is  a  task  much  different  from  that  of  recognizing 
a  text  sentence.  The  difference  crises  both  from  the  difficulty  of  front-end  phoneme  recognition 
and  from  the  articulatory  variability  of  speech.  As  a  result,  phonetic  segments  often  cannot  be 
identified  uniquely  with  any  real  accuracy  and,  therefoie,  an  utterance  processed  by  a  phoneme 
recognizer  can  contain  a  great  deal  of  local  ambiguity  and  error.  In  practical  terms,  this  means 
that  the  words  are  not  recognizable  immediately,  as  they  are  in  text. 

Humans  use  a  great  deal  of  syntactic  and  semantic  information  to  help  them  handle  this 
ambiguity.  They  possess  much  knowleuge  of  the  things  they  talk  about,  and  of  how  words  can  be 
put  together  to  describe  these  things.  Most  previous  work  in  machine  speech  recognition,  how¬ 
ever,  has  dealt  primarily  with  acoustical  processing,  such  as  phoneme  recognition,  pitch  and 
intensity  analysis,  etc.,  with  little  attempt  to  integrate  syntactic  and  semantic  knowledge  into 
this  analysis.  Any  work  on  the  recognition  of  continuous  speech  which  relies  soi^’y  nn  such  a 
low- level  approach  is  a*  the  mercy  of  the  inherent  local  ambiguity 

The  present  researen  demonstrates  how  a  greater  degree  of  language  competence  can  be 
integrated  into  this  recognition  process. 

This  report  develops  an  approach  which  uses  semantic  and  syntactic  information  to  assist 
machine  recognition  of  speech.  The  approach  is  implemented  in  LPARS,  a  Locally  organized 
PARSer  designed  to  process  continuous  speech  with  the  help  of  syntactic  and  semantic  informa¬ 
tion.  Its  approach  differs  from  traditional  parsing  methods  in  that  it  has  no  inherent left-to-right, 
or  right-to-left,  bias  to  its  operation.  Rather,  it  allows  syntactic  structures  to  be  recognized 
locally  ia  any  part  of  the  sentence.  In  fact,  several  parse  structures  may  be  built  v.p  simulta¬ 
neously  ill  different  parts  cf  the  sentence  and  later  connected  by  searching  for  words  that  might 
reasonably  exist  between  them. 

The  approach  ir  highly  data-directed.  letting  previously  recognized  structures  help  guide 
the  search  for  other  words  in  the  utterance.  Both  syntactic  and  semantic  constraints  are  used 
to  help  guide  the  analysis.  Thu3,  the  system  is  a  "vertically  organized"  one  in  the  sense 
that  it  allows  syntax  and  semantics  to  give  feedback  to,  and  help  guide,  the  lower-level  word- 
recognition  process. 

The  emphasis  in  the  present  work  has  been  to  explore  the  various  issues  involved  in  the  de¬ 
sign  of  a  local  parsing  system,  and  then  to  demonstrate  that  such  a  system  can  be  created.  Thus 
empi  asis  has  not  been  on  developing  a  finely  tuned  system  with  high  performance  characteristics. 

The  remainder  of  this  introductory  section  gives  an  overview  of  the  approach  taken.  Sec¬ 
tions  B  ana  C  describe  in  more  detail  the  sources  and  nature  of  local  ambiguity  and  error  in 
speech.  In  See.  D,  two  basic  concepts  are  presented:  that  of  phonetic  distance,  and  that  of  a 
vertically  organized  system.  These  two  basic  concepts  are  central  to  understanding  the  approach 
described.  Then  in  Sec.E,  a  simplified  example  is  worked  through  to  give  a  concrete  picture 
of  how  the  approach  is  implemented.  Thi3  Introduction  concludes  by  discussing  related  work 
(Sec.F)  and  by  outlining  how  the  presentation  of  the  approach  is  developed  in  the  rest  of  this  report. 
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B.  LOCAL  AMBIGUITY  AND  ERROR  IN  SPEECH 


In  the  early  1960's,  researrners  working  on  speech  recognition  approached  the  p - iem 

from  the  local  standpoint  of  recognizing  sound  segments  in  the  utterance  in  much  the  sane  spirit 
t’S  one  might  try  to  recognise  letters  on  a  printed  page.  At  that  time,  most  work  concentrated 
on  the  recognition  of  isolated  words  and  phrases;  thus,  no  extensive  use  of  syntactic  end  se¬ 
mantic  constraints  between  different  parts  of  the  utterance  was  possible.  In  fact,  it  was  initially 
hoped  that  phoneme  recognition  could  be  developed  to  sufficient  accuracy  that~the  parsing  of  con¬ 
tinuous  speech  could  be  accomplished  in  much  the  same  fashion  as  the  parsing  of  text.  ’ 

This  approach  was  not  very  profitabK  .  It  was  difficult  to  develop  algorithms  which  could 
even  approach  the  desired  level  of  accuracy.  Furthermcre,  it  became  apparent  that  the  difficulty 
was  not  entirely  the  fault  of  the  algorithms  used,  but  rather  the  result  of  local  ambiguity  inherent 
in  speech  itself. 

There  are  two  fairly  distinct  but  related  sources  of  local  ambiguity  and  error  in  speech: 

(1)  articulatory  variation,  and  (2)  machine  recognition  limitations.  It  is  worthwhile  to  discuss 
both  briefly. 

1 .  Articulatory  Variation 

A  sequence  of  spoken  phonemes  differs  from  a  sequence  of  letters  written  on  a  page,  because 
the  articulation  of  phonemes  is  a  dynamic  series  of  events  which  takes  place  as  the  vocal  appa¬ 
ratus  changes  from  cne  configuration  to  another.  Each  Dhoneme  can  be  thought  of  as  an  idealized 
representation  of  a  target  configuration  which  the  vocal  apparatus  attempts  to  attain. 

The  exact  manner  and  the  degree  to  which  the  target  configurations  are  attained  may  vary 
in  different  attempts  to  vocalize  a  singie  utterance.  Thus,  the  underlying  phenomenon  of  speech 
itself  is  not  constant  or  consistent;  it  is  a  dynamic  phenomenon  which  is  not  reliably  reproduc¬ 
ible.  Furthermore,  the  acoustic  cues  which  might,  if  present,  uniquely  identify  the  phonetic 
segments  «re  often  absent  or  barely  noticeable. 

2.  Machine  Recognition  ’  '  nitations 

Gra  iting  the  inherently  ambiguous  character  of  speech  itself,  it  is  riot  surprising  that  the 
problem  of  algorithmically  segmenting  and  classifying  spoken  phonetic  segments  is  a  difficult 
one.  A  series  of  dynamic  phenomena  must  be  processed  using  recognition  criteria  which  statis¬ 
tically  are  as  accurate  as  possible.  Since  the  cues  which  uniquely  distinguish  certain  phonemes 
can  be  quite  subtle,  it  is  difficult  to  write  an  algorithm  to  pick  up  these  cues  only  when  they  are 
"really"  present.  Despite  a  number  of  different  attempts  to  develop  phoneme  recognizers,  the 
amount  of  ambiguity  and  error  which  characterizec  existing  phoneme  recognizers  is  quite  large. 

G.  PRACTICAL  IMPLICATIONS  OF  LOCAL  AMBIGUITY 

In  practice,  the  local  ambiguity  described  above  implies  that  w'ords  are  not.  easily  recogniz¬ 
able.  In  this  respect,  it  is  useful  to  make  a  somewhat  arbitrary  distinction  and  identify  two 
separate  problems; 

(1)  Small  tressed  "funct:  >n"  words. 

(2)  Longer  "content"  words  which  are  ccnsidcrably  garbled  (by  excessive  error  and 
ambiguity). 
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1.  Small  Unstressed  Function  Words 

Small  unstressed  words  such  as  "a",  "the",  "and",  "of,  "is",  etc.  tend  not  to  be 
articulated  clearly  and  are  sometimes  barely  articulated  at  all.  For  example,  the  phrase  "Uni¬ 
versity  of  Michigan"  can  be  spoken  qu'te  intelligibly  without,  or  with  just  a  hint  of,  the  word  "of". 
This  pose3  no  problem  to  a  human  listener  because  semantic  and  syntactic  knowledge  primeshirr. 
to  expect  the  word  "of"  to  be  present. 

The  recognition  of  such  words  clearly  poses  difficulties  for  any  speech  recognizer  which 
does  not  use  context  to  indicate  where  such  words  might  be  present.  Indeed,  even  if  the  word 
is  well  articulated,  »;s  small  size,  in  the  presence  of  front-end  error,  mignt  still  make  it  diffi¬ 
cult  to  find  rerably  without  contextual  cues. 

2  Longer  Words  Which  Are  Garbled 

Longer  "content"  word?  offer  more  phonetic  information  to  match  against  the  inpi>*.  Also, 
such  words  contain  stressed  syllables,  which  tend  to  be  articulated  more  clearly  than  unstressed 
syllables.  If  the  system  is  to  work  at  all,  such  long  words  should  be  fairly  reliably  recognizable. 
Frequently,  however,  some  longer  words  may  be  quite  garbled  by  the  combination  of  articulatory 
variability  and  front-end  error.  Any  attempt  to  distinguish  all  such  words  on  the  first  scan 
through  the  senten~ ;  would  be  liable  to  deluge  the  system  with  an  overwhelming  number  of  word 
candidates,  most  of  them  wrong. 

It  is  therefor  much  more  reasonable  to  look  for  such  words  very  selectively,  only  when 
one  ha3  built  up  i  gr_-st  deal  of  contextual  information  indicating  the  exact  section  of  input  where 
such  a  word  might  »'  dst.  and  indicating  a  small  list  of  word  possibilities  that  might  meaningfully 
be  present. 

Ii  is  clear  that  humans  deal  with  the  ambiguity  described  above  because  they  possess  a  great 
deal  of  knowledge  of  the  things  (objects,  events,  actions,  etc.)  being  talked  about,  and  of  how 
utterances  are  formulated  in  English  to  describe  these  things.  For  instance,  if  a  speaker  pauses 
in  the  middle  of  a  sentence,  a  listener-  can  often  predict  that  one  of  a  limited  number  of  words 
is  likely  to  be  spoken  next,  Certainly  the  listener  can  reasonably  discard  as  possibilities  most 
of  the  words  in  his  vocabulary. 

The  present  work  explores  how  such  knowledge  can  be  incorporated  into  a  machine  so  that 
it  can  be  used  in  a  systematic  fashion. 

?.  Tradeoffs  in  a  Speech  Recognition  System 

An  interesting  issue  related  to  this  problem  concerns  the  tradeoffs  possible  between  in¬ 
creased  refinement  in  the  local  processing  of  an  utterance  (i.e.,  refinement  of  phoneme  recog¬ 
nition  algorithms)  on  the  one  hand,  and  the  use  of  more  global  information  on  the  other.  In  other 
words,  how  much  real  information  does  increased  local  refinement  yield  when  seen  in  terms  of 
the  system  as  a  whole?  Cine  would  suspect  that  intelligent  heuristics  in  the  higher  levels  of  a 
system  might  make  a  great  deal  of  lower-level  refinement  unnecessary.  Other  parameters  in 
this  tradeoff  are  vocabulary  size,  syntactic  and  semantic  complexity,  me  degree  to  which  small 
words  are  Allowed  to  be  used,  and  the  degree  to  which  the  vocabulary  is  partitionable  so  that  at 
different  stages  of  discourse  only  part  of  the  total  vocabulary  is  likely  to  be  used.  This  report 
takes  an  initial  step  toward  exploring  some  of  these  tradeoffs. 
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D,  TWO  BASIC  CONCEPTS 

Some  aspects  of  the  system  presented  here  may  at  first  seem  somewhat  counter-intuitive. 
This  section  outlines  two  concepts  central  to  the  approach:  that  of  a  vertical  system  (one  with 
vertically  orgam  .'d  feedback),  and  that  of  phonetic  distance.  It  is  our  hope  that  an  understand¬ 
ing  of  these  two  concepts  will  help  to  put  the  approach  into  sharper  perspective. 

i.  Vertically  Organized  System 

•LFARS  is  a  vertically  organized  system.  Vertical  system  organization  is  useful  where  a 
great  deal  of  local  ambiguity  exists.  A  vertical  system  is  one  which  allows  tne  various  levels 
of  analysis  to  interact  with  one  another.  In  the  case  of  speech  recognition,  the  various  levels 
are  semantic,  syntactic,  and  lexical  analysis  (and  perhaps  phonemic  analysis).  Figure  1-1  il¬ 
lustrates  the  structure  of  such  a  system. 

I  SEMANTICS 


WORD  RECOGNITION 

- _r - 

i 

Input 

Fig.  1-1.  A  vertical  system. 

A  vertical  system  is  so  named  to  distinguish  it  from  a  "strictly  hierarchical"  system  which 
does  not  have  the  feedback  from  higher-  to  lower-level  analysis.  An  example  of  a  strictly  hier¬ 
archical  system  is  a  programming  language  translator,  typically  having  a  lexical  analyzer  which 
recognizes  identifiers,  numbers,  operators,  etc.  and  passes  them  on  to  the  parser  which  deduces 
their  syntactic  structure  and  turns  this  over  to  the  semantic  analyzer  which  generates  machine 
instructions.  Such  a  system  is  not  a  vertical  one  because,  for  instance,  the  parser  cannot  tell 
the  lexical  analyser  what  tokens  it  might  look  for  next. 

Section  I-F-4  discusses  two  other  examples  of  vertically  organized  systems:  the  M.l.T.  Al 
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vision  system,  and  T.  A.  Winograd's  natural  language  parser.  Vertical  organization  >n  the 

vision  system  allows  higher-level  analysis  to  be  invoked  even  though  3omc  lines  of  a  scene  may 
not  have  been  recognized.  Similarly,  in  speech  analysis,  all  words  need  not  be  recognized  im¬ 
mediately  by  the  lexical  analyzer.  Contextual  information  can  be  used  to  help  determine  bound¬ 
aries  in  unclear  sections  of  the  utterance,  and  to  propose  words  that  might  occur  there. 

Thus,  in  LPARS,  the  lexical  analyzer  first  makes  local  tests  for  words  which  can  be  recog¬ 
nized  fairly  reliably.  This  processing  results  in  a  list  of  "word  candidates,"  some  of  which  may 
be  wrong.  The  word  candidates  are  turned  over  to  the  higher-level  routines  which  direct  tests 
to  discover  additional  words.  Fundamental  to  this  analysis  is  the  ability  to  parse  outward  from 
a  number  of  recognized  portions  of  a  sentence  in  an  organized  manner. 

2.  Phonetic  Distance 

The  concept  of  phonetic  distance  goes  hand  in  glove  with  that  of  a  vertically  organized  system. 
Phonetic  distance  is  used  when  matching  the  phonetic  spelling  of  vocabulary  words  in  the  diction¬ 
ary  against  strings  of  phonemes  recognized  by  the  front  end  of  the  system.  As  an  example,  one 
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simple  meas..-e  of  phonetic  distance  might  be  to  define  the  distance  of  a  word  from  a  section  of 
input  as  equal  to  the  number  of  phonemes  that  differ.  Thus,  FROM  would  be  a  distance  of  1 
from  both  FROL  and  TROM. 

In  practice,  of  course,  some  differences  are  greater  than  others.  The  voicing  difference 
between  3  and  P,  for  instance,  is  presumably  not  as  great  as  tne  difference  between  B  and  S. 
Consequently,  it  is  useful  to  ailow  such  phonetic  distances  wider  range  for  more  accurate  dis¬ 
crimination.  The  concept  of  phonetic  distance  must  airo  be  extended  to  account  for  deleted  and 
inserted  phonemes. 

It  is  not  immediately  obvious  how  such  distances  a.-e  best  measured.  To  make  such  meas¬ 
urements  certainly  requires  a  statistical  evaluation  of  the  particular  front  end  being  used.  These 
issues  are  discussed  more  fully  in  Sec.  III.  Hopefully,  it  will  become  clear  that  precisely  how 
such  phonetic  distance  is  measured  is  not  central  to  the  ideas  presented  in  this  report  (although 
the  effect  of  these  measures  is  definitely  central). 

The  concept  of  phonetic  distance  fits  very  naturally  into  a  vertical  system.  In  a  vertical 
speech  system,  initial  scans  through  the  input  can  be  made  at  low  phonetic  distance  to  find  words 
that  are  not  too  garbled.  Higher  phonetic  distance  tests  for  more  garbled  words  can  be  maue 
selectively,  based  on  semantic  and  syntactic  information. 

E.  SIMPLIFIED  EXAMPLE  OF  LPARS  IN  OPERATION 

LPARS  is  designed  to  process  spoken  input  by  building  up  parse  structures  in  any  part  of  a 
sentence,  and  then  by  using  these  structures  to  help  guide  the  search  for  further  words  to  com¬ 
pete  the  sentence.  It  differs  from  most  parsers  in  the  local  nature  of  its  syntactic  processing 
and  in  its  highly  vertical  structure. 

This  section  gives  a  brief  introductory  description  of  LPARS  itself  and  of  how  it  performs 
lexical,  syntactic,  and  semantic  processing.  Then,  a  simplified  example  of  LPARS  in  operation 
is  worked  through  to  help  put  into  focus  the  various  parts  of  the  system. 

1.  Some  Particulars 

LPARS'  vocabulary  contains  approximately  70  words.  It  recognizes  a  very  restricted,  but 
linguistically  interesting,  subset  of  English.  Its  semantics  are  defined  in  terms  of  a  particular 
scene  (a  small  two-room  house  containing  people,  furniture,  fixtures,  etc.)  about  which  one  may 
make  statements,  ask  questions,  tell  a  very  simple-minded  story,  or  command  the  system  to 
manipulate  the  scene. 

2.  Operation  of  LPARS 

A  3  input,  LPARS  expects  a  siring  of  phoneme  candidates  from  a  front-end  phoneme  recog¬ 
nize  r.  For  the  present  work,  the  input  was  prepared  by  a  phonetic  scrambler  program  which 
simulated  front-end  behavior,  rather  than  by  a  real  phoneme  recognizer.  The  input  phoneme 
candidates  may  be  ambiguous  (i.e.,  several  possibilities  may  be  given  for  one  segment).  The 
inpit  is  also  expected  to  contain  a  fair  amount  cf  error. 

L  PARS  operates  by  tirst  making  an  initial  scan  through  the  sentence,  seeking  longer  words 
which  are  not  too  garbled.  The  scan  returns  a  list  of  word  candidates  which  match  within  a  given 
erro-  tolerance  to  specified  sections  of  the  input.  It  is  likely  that  som:  of  these  word  candidates 
are  mcoi  rect,  and,  indeed,  some  may  overlap. 
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INPl't  SENTENCE: 


•THE -LARGE  COFFEETAELE  SUPPORTS  THE  GREEN  DICTIONARY* 


(1)  INITIAL  SCAN:  COFFEE  TABU,  GREEN 


tf)  JDCAL  HIGHER  DISTANCE  SCANS: 


STf.CH 


CO.  --ELT;.BU 
(od>)  (prep) 

(del)  (verb) 

LARGE  NIL 

(ooj) 

(del) 

THE 


GREEN  ENDCH 

(odj)  (noon) 

(det)  DICTIONARY 

THE 
(verb) 

(Prep) 


.3)  PARTIAL  PARSE  TREE  CONSTRUCTION: 


the  lorge  cofleetnble 


‘He  green  dictionory 


(4)  CONNECTION  OF  PARTIAL  PARSE  TREES: 


NP  A 

A 

(5)  RECOGNIZED  SENTENCE: 


A  I  A 

supports 


Fig.  1-2,  Simple  example  o*  FARS  in  operation. 
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These  word  candidates  are  turned  over  to  the  higher-level  part  of  the  system  which  initiates 
scans  for  small  words  and  for  more  highly  jled  words  in  the  areas  adjacent  to  and  between 
the  words  found  in  the  initial  scan,  and  groups  the  words  together  into  parse  structures.  This 
processing  is  done  systematically  in  an  a'tempt  to  uncover  the  entire  utterance. 

3.  Simple  Example 

Figure  1-2  gives  a  simplified  example  of  LPARS  in  operation.  The  input  utterance  being 
processed,  "The  large  coffeetable  supports  the  green  dictionary"  is  spoken,  analyzed  by  a  front- 
end  phoneme  recognizer,  and  a  string  of  phoneme  candidates  is  produced  for  input  to  LPARS. 

LPARS  fir.^t  makes  an  initial  scan  through  the  sentence  at  a  low  phonetic  distance  looking 
for  longer  words  which  are  not  too  garbled.  Let  us  assume  that  in  this  instance  the  two  word 
candidates  "coffer  ible*  and  "green"  are  found,  These  word  candidates  are  turned  over  to  the 
high-levei  part  of  the  system,  together  with  a  start-of-sentence  character  (STRCH)  and  an  end- 
of-sentence  character  (ENDCH)  which  the  system  adds. 

The  higher-level  analysis  consists  of  three  fairly  distinct  stages,  as  Fig.  1-2  indicates. 

First,  scans  at  higher  phonetic  distance  are  made  based  on  fairly  local  cues,  in  the  areas  sur- 
-ounding  the  word  candidates.  In  this  example,  these  initial  higher-distance  scans  are  very 
simple.  A  scan  in  front  of  the  noun  uncovers  tne  adjective  "large",  and  a  further  scan  in  front 
of  that  word  uncovers  the  determiner  "the".  A  scan  to  the  right  of  the  noun  for  prepositions  and 
verbs  fails  to  find  any  word  candidates.  This  means  that  the  verb  "supports"  is  too  garbled  to 
be  picked  up  either  by  the  initial  scan  or  by  the  higher-plionetic-distance  selective  scan.  Similar 
scans  around  the  adjective  "green"  uncover  the  words  "dictionary"  and  "the",  but  again  fail  tc 
find  the  verb. 

After  hese  higher-distance  scans  have  taken  place,  the  system  builds  up  local  parse  struc¬ 
tures  in  the  sentence,  it  all  the  words  in  the  sentence  have  been  found,  then  the  entire  sentence 
is  constructed.  Otherwise,  the  result  is  a  number  of  partial  parse  trees  in  different  parts  of 
the  sentence. 

The  system  attempts  to  construct  as  many  such  partial  parse  trees  as  it  can  with  the  words 
it  lias  found.  In  this  example,  it  constructs  only  two  trees.  The  first  tree  (STRCH  the  large 
coffeetable)  is  straightforward;  the  second  tree  (the  green  dictionary  ENDCH)  is  somewhat 
unusual  since  it  contains  an  "ancestor  link"  (the  link  labeled  *).  Such  ancestor  iinks  are  neces¬ 
sary  in  ’  PARS  because  of  a  fundamental  problem  of  local  parsing:  namely,  that  it  is  not  always 
clear  fi  n  a  small  amount  of  local  context  what  the  global  relationship  is  between  two  adjacent 
syntactic  units.  In  this  case,  the  exact  relationship  of  the  noun  phrase  to  the  end-of-sentence 
character  is  not  clear.  Or.  the  other  hand,  for  the  nurposes  of  ,  uilding  local  structure,  LPARS 
need  not  kaow  the  actual  relationship.  The  ancestor  link  allows  LPARS  to  .ecognize  that  many 
syntactic  relationships  are  possible,  but  to  defer  commitment  until  later  when  an  attempt  is  made 
to  join  this  structure  to  other  structuies  by  proposing  words  between  them. 

Tne  third  and  final  step  in  the  parsing  process  consists  of  connecting  partial  parse  tre~s  to 
one  another  by  using  the  grammar  to  propose  words  that  might  exist  between  them.  Any  words 
proposed  are  tested  against  the  input  at  even  higher  phonetic  distances  than  the  previous  scans. 

In  the  example  given  here,  the  algorithm  discovers  that  the  two  parse  trees  can  be  connected 
by  a  verb.  It  therefore  initiates  a  higher-distance  scan  which  succeeds  in  finding  the  verb  "sup¬ 
ports".  Thus,  the  entire  sentence  is  recognized. 
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One  of  the  advantages  of  the  local  approach  Is  that  this  algorithm  can  attempt  to  connect 
the  most  promising  partial  parse  trees  first  (for  instance,  the  longer  ones).  In  this  way,  the 
local  organization  allows  the  system  to  take  advantage  of  the  maximum  amount  of  well-matching 
information  from  all  parts  of  the  sentence  to  direct  attention  to  areas  where  highly  garbled 
words  might  exist,  and  to  indicate  what  words  might  reasonably  make  sense  there.  A  left-to- 
right  parser  would  be  unable  to  take  advantage  of  such  information. 

Notice  that  this  example  is  a  simplified  one  —  no  erroneous  words  were  found.  In  a  more 
realistic  example,  erroneous  words  would  be  found,  some  local  structures  containing  these 
words  would  be  built  up  and,  additionally,  some  erroneous  local  parsings  of  the  correct  words 
could  be  constructed,  'opefully,  attempts  to  build  out  upon  the  erroneous  structures  would 
eventually  prove  unsuccessful,  while  attempts  to  build  out  upon  the  correct  structures  would 
us  sally  succeed. 

Notice  also  that,  in  this  example,  semantics  have  not  beer,  mentioned.  Semantics  enter 
i  ito  LPARS'  analysis  primarily  by  reducing  the  number  of  words  that  need  be  sought  by  the 
hi-her-distance  scans.  For  instance,  tests  for  adjectives  preceding  a  noun  need  only  look  for 
adjectives  that  may  legitimately  modify  that  noun. 

4.  Summary  Overview 

Figure  1-3  gives  a  condensed  overview  of  the  procedure  we  have  just  worked  through. 
LPARS  receives  its  input  in  the  form  of  phoneme  candidates.  It  makes  an  initial  scan  at  a  low 
phonetic  di  'ance  yielding  a  list  of  word  candidates,  some  of  which  may  be  incorrect.  It  uses 
these  to  initiate  higher-distance  scans,  to  construct  partial  parse  trees,  and,  if  lucky,  to  un¬ 
cover  the  entire  sentence.  Finally,  LPARS  investigates  ways  to  connect  the  partial  pai-se  trees 
together  to  build  up  the  whole  sentence.  Section  VI  describes  this  process  in  more  detail. 

INPUT 


sentence 

candidates 


Fig.  1-3.  Overview  of  LPARS. 
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F.  RELATED  WORK 


Two  useful  surveys  of  speech  literature,  with  extensive  bibliographies,  are  given  in  Refs  1 
and  2.  Previous  work  related  to  the  present  research  can  be  separated  into  four  fairly  distinct 
topics. 

1.  Isolated-Word  Recognition 

Numerous  different  approaches  have  been  taken  to  the  problem  of  recognizing  single  words 
spoken  in  isolation,  given  a  fairly  small  voo-oulary.  Gold,"’  and  Bob  row  and  Klatt^  achieved  86- 
to  95-percent  recognition,  each  using  a  vocabulary  of  54  words.  In  both  cases,  recognition  was 

done  without  trying  to  segment  and  identify  the  phonemes  that  made  up  the  utterance. 

7  8 

Medress  and  Vicens  developed  systems  which  performed  the  intermediate  step  of  attempt¬ 
ing  phoneme  identify  at  ion  before  trying  to  identify  the  word  spoken.  These  systems  achieved 
recognition  accuracy  imilar  to  that  of  Gold  and  Bobrow.  The  Vicens  system  used  a  vocabulary 
of  56l  words. 

In  the  present  research,  we  assume  that  input  to  LPARS  is  in  the  form  which  is  output  by 
the  Medress  and  Vicens  systems:  a  string  of  phoneme  candidates  which  may  contain  large 
amounts  of  ambiguity  and  error. 

2.  Continuous-Speech  Recognition 

g 

Vicens  also  coupled  his  phoneme  recognizer  to  a  very  primitive  continuous -speech  system 
which  worked  by  making  an  initial  scan  through  the  utterance  searching  for  a  certain  highly  recog¬ 
nizable  word,  ard  then  used  a  very  constrained  syntax  together  with  a  very  small  vocabulary  to 
guide  its  search  for  the  rest  of  the  words  in  the  utterance.  LPARS  can  be  thought  of  as  an  exten¬ 
sive  generalization  of  this  simple  system, 
q 

Reddy  recently  demonstrated  a  system  which  recognizes  spoken  chess  moves,  expressed 
in  a  simple  nonrecursive  syntax. 

The  system  of  Tappert  and  Dixon^’11  achieved  good  results  by  attempting  to  identify  "trans- 
ernes"  (phoneme-phoneme  transitions)  before  matching  the  utterance  as  a  whole.  Rabinowitz,12 
working  with  Tappert  and  Dixo.i,  experimented  with  a  left-to-right  continuous -speech  recognizer 
for  a  simple  nonrecursive  syntax,  with  success  rates  ranging  from  65  to  90  percent. 

3.  Natural- Language  Processing 

Most  work  done  in  recognition  of  natural-language  input  has  concentrated  on  left-to-right 
algorithms.  The  work  of  Woods1  ' 14  and  Winograd^  ...  'latural-language  parsing  is  closely  re¬ 
lated  to  the  present  research. 

Woods  uses  the  fo-  malism  of  an  augmented  transition  net  grammar  to  capture  the  regulari¬ 
ties  of  syntactic  structure.  Our  work  also  uses  this  augmented  transition  net  formalism,  but  it 
is  incorporated  into  a  local  rather  than  f  left-to-right  framework. 

Winograd  uses  PROGRAMMAR  procedures  and  a  set  of  systemic  syntactic  features  to  embody 
his  parsing  logic.  The  syntactic  features  used  in  our  work  are  very  similar  to  Winograd's. 

4.  Vertical  Systems 

The  M.I.T.  AI  Laboratory  vision  system  uses  the  basic  idea  of  a  vertical  system  (see  Ref.  3) 
which  allows  global  structure  to  be  u.ied  to  help  guide  the  more  local  processing,  thus  saving  a 
great  deal  of  unnecessary  work.  In  recognizing  a  scene  of  M^nks,  for  instance,  not  all  lines 
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must  be  recognized  by  the  line  recognizer  before  higher-level  analysis  is  invoked.  In  such 
situations  higher-level  analysis  can  propose  possible  missing  lines  and  direct  finer  statistical 
tests  to  look  for  them. 
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An  interesting  part  of  Winograd's  system  is  its  ability  to  call  on  semantic  analysis  with 
partially  recognized  structures,  in  a  vertical  fashion,  allowing  semantics  to  abort  nonsensical 
local  parses  early  in  the  analysis.  Thus,  semantics  plays  a  confirmational  role,  answering 
"ycs-no*  questions,  but  does  not  exercise  overt  control. 

In  LPARS,  semantics  performs  a  more  active  function  by  suggesting  possible  action  in  a 
"creative*  role,  such  as  supplying  a  small  list  of  word  possibilities  to  test  locally  against  a 
specific  section  of  the  input. 

G.  OVERVIEW  OF  SECTIONS  II  THROUGH  VIII 

Section  II  describes  the  transition  network  formalism,  the  set  of  syntactic  features  used  to 
augment  it,  and  the  fairly  simple  semantic  component  of  LPARS.  The  transition  network  for¬ 
mal!'  n  formti  the  backbone  of  LPAK3'  parsing  activity. 

The  various  sub-parts  of  the  LPARS  system  are  described  in  Secs.  HI,  IV,  and  V.  Section  III 
describes  how  "scrambled"  input  is  prepared  for  LPARS  by  a  front-end  simulator,  and  how  word 
recognition  is  performed.  Section  IV  describes  the  ir.tial  stages  of  LPARS'  processing:  the 
initial  scan,  and  the  fairly  local  higher-phonetic-distance  matching.  It  also  discusses  some 
interesting  way3  in  which  this  part  of  LPARS  could  be  extended.  The  two  main  algorithms  used 
by  LPARS  in  its  local  syntactic  processing  are  outlined  in  Sec.  V. 

Section  VI  describes  the  overall  coordination  of  the  LPARS  system,  and  discusses  some 
basic  problems  and  considerations  hi  the  design  of  a  locally  organized  system  such  as  l.PARS. 

The  experimental  evaluation  of  LPARS  is  described  and  seme  sample  runs  are  exhibheu 
in  Sec.  VII. 

Finally,  Sec.  VIII  summarizes  the  various  issues  which  have  been  dealt  with  in  developing 
the  local  approach  toward  parsing,  and  discusses  some  interesting  implications  of  the  approach. 
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11.  A  TRANSITION  NETWORK  GRAMMAR  WITH  SYNTACTIC  FEATURES 

LPARS  uses  an  augmented  transi'iou  network  grammar  as  the  basis  for  its  syntactic  activity. 

The  transition  network  formalism  used  by  LPARS  is  somewhat  different  from  previous  transition 

1  3  1.4 

net  parsers  such  as  Woods'.  ’  A  few  of  these  differences  are  primarily  in  form:  others  are 
significant  changes  made  to  accommodate  the  local  nature  of  LPARS1  operation. 

We  now  describe  how  LPARS  has  adapted  the  transition  net  grammar  to  accommodate  its 
local  parsing  purposes.  Section  A  introduces  *he  transition  retwork  formalism;  Sec.  B  describes 
how  LPARS  augments  its  transition  net  grammar  by  a  set  of  syntactic  features,  and  discusses 
some  of  the  advantages  gained  thereby;  in  Sec.  C,  the  syntax  used  by  LPARS  i3  presented,  and 
its  restrictions  are  discussed;  Sec.  D  describes  the  semantic  component  of  LPARS;  and  finally 
the  action  routines  which  are  associated  with  the  arcs  of  the  grammar  are  discussed  in  Sec.  E. 
The  action  routines  keep  track  of  the  syntactic  features  and  examine  the  semantic  features  of 
structures  bein"  parsed,  to  enforce  consistency  between  different  parts  of  the  sentence 

Two  basic  problems  are  confronted  here: 

(1)  How  to  organize  a  grammar  so  that  recognition  car.  be  started  easily  anywhere 
within  a  sentence 

(2)  How  to  organize  a  grammar  so  that,  when  local  structure  building  occ.rs,  any  po¬ 
tentially  exponential  inefficiencies  are  kept  within  reasonable  limits. 

A.  TRANSITION  NET  FOP.MAUSM 

This  section  describes  the  basic  structure  of  a  transition  net  grammar.  The  description  is 
kept  quite  informal;  however,  as  the  chapter  progref  ses  and  more  detailed  examples  are  pre¬ 
sented,  the  nature  of  the  transition  network  as  used  in  .’.PARS  will  become  more  concrete. 

A  transition  net  grammar  consists  of  a  number  of  finite-state  networks  (nets)  each  defining 

a  nonterminal  syntactic  class  (i.e.,  noun  phrase,  sentence,  etc.).  The  idea  of  representing  non- 
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tei  oinal  syntactic  classes  by  such  transition  diagrams  was  previously  developed  by  Woods, 
whom  'e  quote: 

"The  transition  network  grammar  model  is  an  extension  of  the  notion 
of  state  transition  diagram  well-known  to  automata  theory.  A  transi¬ 
tion  network  grammar  consists  of  a  network  of  nodes  with  arcs  con¬ 
necting  them.  The  nodes  represent  states  of  a  hypothetical  parsing 
machine  and  the  arcs  connectmg  them  represent  possible  transitions 
and  are  labelled  with  the  types  of  events  in  the  environment  of  the  ma¬ 
chine  which  permit  transitions.  In  the  case  of  a  transition  network 
grammar,  the  types  of  events  are  the  occurrences  of  words  and  phrases 
in  the  input  string  upon  which  the  grammar  is  operating." 

Figure  11-1  giv^s  a  rimple  example  of  such  a  grammar. 

Each  net  is  made  up  of  a  number  of  states  connected  by  arcs.  Each  arc  is  labeled  either 
by  the  name  of  a  terminal  syntactic  class  (noun,  verb,  determiner,  etc.),  or  by  the  name  of  a 
nonterminal  Term.nal  arcs  cc  espond  to  transitions  permitted  by  a  single  word  being  proc¬ 
essed  in  the  input  stream.  Nonterminal  arcs  require  the  recursive  invocation  of  a  nonterminal 
net  to  recognize  a  phrase  or  clause.  In  LPARS,  an  arc  can  also  be  labeled  by  LAMBDA;  this 
arc  allows  an  immediate  transition  without  processing  any  input.  Each  net  contains  at  least  one 
initial  state  (marked  £)  )  and  at  least  one  final  state  (marked  0  ). 
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Fig. 11-1. 

Associated  with  ea^h  arc,  and  with  each  initial  and  each  final  state,  is  an  action  routin' 
which  makes  tests  to  determine  when  a  transition  is  permitted  and  when  a  state  can  be  an  initial 
or  a  f  nal  state.  These  tests  keep  track  of  a  set  of  syntactic  features  and  other  information  about 
the  constructs  being  parsed  Thus,  the  transition  net  is  r  augmented"  by  the  activity  of  these 
action  routines.  The  operation  of  these  routines  is  discussed  in  Sec.  Il-E. 

Thus,  for  each  nonterminal,  a  transition  network  defines  the  structures  wt  'jh  can  form 
that  nonterminal.  For  instance,  in  the  simple  grammar  above,  a  noun  phrase  (NP)  is  defined  as 
<  determiner  (DET),  followed  by  zero  o»*  more  adjectives  (ADJs),  followed  by  a  noun,  and  fol¬ 
lowed  optionally  by  a  prepositional  phrase  (PP).  For  a  string  of  words  to  be  accepted  by  a  transi¬ 
tion  net,  that  string  must  permit  a  sequence  of  transitions  through  the  net  beginning  on  an  initial 
state  and  ending  on  a  final  state.  To  allow  this,  the  action  routines  associated  with  the  arcs 
traversed,  and  with  the  initial  and  final  states,  must  permit  the  sequence  of  transi'ions. 

In  a  left-to-right  parser,  these  transition  nets  can  be  t.iought  of  as  a  flowchart  of  the  parser's 
action  when  analyzing  a  sentence.  In  a  system  such  as  LPARS,  which  performs  local  parsing, 
the  transition  nets  are  best  thought  of  rather  as  defining  acceptable  left-to-right  adjacencies  for 
syntactic  structure. 

B.  SYNTACTIC  FEATURES 

Central  to  the  LPARS'  syntactic  formalism  are  a  set  of  ■*— tactic  features  with  which  LPARS 
augments  the  transition  net  grammar.  Syntactic  features  mak  *  easier  to  organize  the  grammar 
so  that  parsing  can  be  started  in  the  middle  of  a  transition  .  yet  still  proceed  in  in  organized 
fashion.  Syntactic  features  also  allow  a  quite  compact  grammar  to  be  created  This  section  de¬ 
scribes  how  '.hese  syntactic  features  are  used  and  why  they  are  important  To  this  end,  we  first 
work  through  r  simplified  example. 

1.  Example:  Relative  Clauses 

A  relative  clause  can  be  thought  of  as  a  permutation  of  the  structure  of  an  English  sentence 
In  transformational  theory,  a  relative  clause  is  thought  of  as  being  formed  by  moving  a  noun 
phrase  to  the  front  of  a  sentence,  replacing  it  with  a  relative  pronoun  For  example,  if  one  as¬ 
sumes  the  following  simple  underlying  structure  for  a  sentence: 

NP-V-NP-PP  (noun-phrase,  verb,  noun-phrase,  prepositional-phrase) 
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as  in  "the  man  places  the  book  on  the  table",  then  the  relative  clauses  shown  in  Fig.  11-2  can  be 
created  (In  this  figure,  the  terminal  WH  represents  a  relative  pronoun:  who,  whom  or  which.) 

ORIGINAL  SENTENCE: 


0. 

NP 

V  NP 

PP 

(declorotive) 

RELATIVE  CLAUSES: 

1. 

WH 

V  NP 

PP 

(subject  r.c.) 

2.  Wh 

NP 

V 

PP 

(object  r.c.) 

3.  PREP  WH 

NP 

V  NP 

Fig.  11-2. 

' 

(prepositiono!  r.c 

In  structure  1  (WH  V  NP  PP),  the  first  NP  (already  at  the  start  of  the  clause)  is  relativ¬ 
ized:  as  ir.  .  v;ho  places  the  book  on  the  table.  .  .  ". 

In  structure  2  (W11  NP  V  PP),  the  second  NP  is  brought  to  the  head  of  the  clause  and  rel¬ 
ativized:  as  in*...  which  the  man  places  on  the  table.  .  .  ". 

In  structure  3  (PREP  WH  NP  V  NP),  the  third  NP  together  with  the  preceding  preposition 
is  brought  to  the  head  of  the  clause  and  relativized:  as  in  " .  .  .  on  which  the  man  places  the 
book. . .  ".  Notice  that  a  fourth  structure  can  also  be  formed  with  the  preposition  re  aining  at 
the  end  of  the  clause:  as  in  ".  .  .  which  the  man  places  the  book  on.  .  .  ". 

These  three  relative  clause  structures  can  be  labeled  "subject  relat;ve  clause"  (SRC), 
"object  relative  clause"  (ORC),  and  "prepositional  relative  clause"  (PRC);  these  names  indicate 
the  noun  phrase  in  the  original  declarative  (DCL)  form  which  was  relativized.  Later  in  this  ex¬ 
ample.  these  names  are  used  as  "syntactic  features"  to  help  guide  the  parsing  process. 

Thus,  Fig.  11-2  outlines  in  tabular  form  a  set  of  "linguistic  facts"  describing  the  structure 
of  a  declarative  sentence  and  of  three  types  of  relative  clauses.  Figure  II— 3  shows  a  possible 
way  that  these  structures  might  be  represented  in  a  transition  net  grammar. 


V  NP  PP 


Fig.  11-3. 

In  this  transition  network,  each  structure  is  spelled  oat  by  a  separate  branch  of  the  network. 
Notice,  however,  that  there  is  a  great  deal  of  similarity  between  the  different  branches  This 


approach  to  representing  syntax,  although  inelegant,  woi  id  probably  not  be  too  costly  in  a  left- 
to-right  parser,  since  the  parser  would  mt  progress  too  far  down  a  dead-end  branch  before 
backing  up. 

For  LPARS,  however,  such  an  approach  would  ^e  very  inefficient  because,  due  to  its  local 
parsing  activity,  the  syntactic  part  of  LPARS  must  'jonstruct  as  many  valid  parse  structures  as 
possible  from  the  words  passed  to  it.  Since  LPAL'S  has  no  way  of  knowing  the  correct  left-to- 
right  context  in  which  a  set  of  words  might  exist,  it  must  assume  that  any  context  is  possible. 
Thus,  for  example,  if  it  finds  a  noun  phrase  to  the  left  of  a  verb,  LPARS  must  construct  one 
parse  tree  for  every  instance  in  the  grammar  where  an  NP  arc  adjoins  a  verb  arc.  If  there  are 
thirty  such  instances,  then  thirty  parse  structures  must  be  created. 

It  would  be  inefficient  for  LPARS  to  represent  a  good-sized  subset  of  the  different  structural 
permutations  of  an  English  sentence  by  "spelling  out"  each  permutation  as  in  Fig.  11-3.  Such  a 
system  would  not  only  be  inelegant,  but  would  not  allow  efficient  extension.  The  inherent  ineffi¬ 
ciencies  would  rise  exponentially  as  more-anu-more  syntactic  structure  was  incorporated 

LPARS  avoids  this  problem  by  using  syntactic  features  which  allow  us  to  collapse  a  number 
of  different  permutations  of  a  sentence,  for  instance,  onto  a  3ingle  transition  net  skeleton.  Let 
us  see  how  this  is  done,  continuing  the  relative  clause  example. 

The  linguistic  facts  portrayed  in  Fig.  11-2  can  be  described  using  a  transition  net  with  four 
syntactic  features:  DCL,  SRC,  ORC,  PRC,  standing  for  each  of  the  four  possible  sentence  per¬ 
mutations.  Each  syntactic  feature  represents  a  particular  phenomenon  which  might  or  might  not 
be  present.  Therefore,  each  syntactic  feature  has  associated  with  it  a  Boolean  value  of  either 
or  "  indicating  whether  that  phenomenon  might  be  present  or  not. 

Let  us  see  how  s  >ch  features  can  be  incorporated  into  a  transition  net.  Figure  1 1 -4  shows  a 
transition  net  for  the  declarative  sentence  The  feature  (DCL+)  is  associated  with  each  arc.  (In 
the  LPARS  system,  these  features  are  actually  built  into  the  action  routines  which  arc  associated 
with  each  arc.) 


Fig.  11-4. 

Now  let  us  incorporate  the  subject  relative  clause  structure  into  this  net.  To  do  this,  we 
add  an  arc  labeled  \VH  with  the  features  (DCL-,  SRC+)  and  label  the  other  arcs  with  the  appre- 
priate  SRC  feature  as  shown  in  Fig.  11-5. 


DCL+  DCL+  DCL+  DCL+ 

SRC*  SRC+  SRC+  LRC+ 


WH  \ 


DCL- 
SRC  + 


Fig.  II-5, 
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The  syntactic  features  associated  with  eact  ai  c  represent  tests  which  must  be  performed 
in  the  action  routine  associated  with  the  arc,  Exac'-ly  how  these  tests  operate  is  described  in 


Sec,  1I-E, 

Now  let  us  add  the  object  relative  clause  structure.  This  can  be  done  by  adding  a  second 
possible  initial  state,  a  WH-labeled  arc  from  that  stite  to  the  first  initial  state,  and  a  uAMBDA 
arc  to  bypass  the  second  NP,  as  shown  in  Fig,  11-6, 


DCL- 

DCL+ 

DCL+ 

DCL+ 

DCL+ 

SRC- 

SRC- 

SRC+ 

SRC  + 

SRC+ 

ORC+ 

ORC+ 

ORC+ 

ORC- 

ORC+ 
PP  f 

\  WH 

NP  / 

JlL 

NP  / 

0  l 

tjy 

t: 

pjy 

5> 

/  WH 

/ 

U  MBDA 

DCL-  DC  L- 


SRC+ 


SRC- 


ORC- 


ORC  + 


Fig,  11-6. 


A  LAMBDA  arc  is  one  which  can  be  traversed  without  parsing  any  input.  Notice  that  it  is 
only  necessary  to  add  a  small  amount  of  structure  in  the  form  of  states  or  arcs  when  each  new 
structural  permutation  is  incorporated  into  the  existing  net.  Most  of  the  structure  is  carried  in 


the  syntactic  features  associated  with  the  arcs. 

Finally,  the  prepositional  relative  clause  structure  is  added  as  shewn  in  Fig.  11-7.  Here, 
an  additional  initial  state  and  an  arc  labeled  PREP  are  added,  and  a  previous  state  is  made  a 


possible  final  state. 


DCl- 

SRC- 

ORC- 


DCL- 

SRC- 

ORC+ 


DCL+ 
SRC- 
ORC  + 


DCL+ 

SRC+ 

ORC+ 


DCLf 

SRC+ 

ORC- 


DCL+ 

SRC+ 

ORC+ 


ORC- 

PRC- 


ORC+ 

PRC- 


Fig.  11-7, 


To  make  this  net  complete,  a  set  of  initial  and  final  state  tests  must  be  created  to  indicate 
which  permutation  can  begin  and  end  on  the  various  initial  and  final  states.  Thus,  the  following 
feature  sets  must  be  associated  with  the  initial  and  final  states; 
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Initial  States 


1. 

(DCL-  SRC-,  ORC-, 

FRCH 

(DCL-,  SRC—,  ORC  +  , 

PRC—) 

3. 

(DCL+,  SRC+,  ORC-, 

PRC-) 

Final  States 

6. 

(DCL.-,  SRC  -,  ORC  — , 

PRC+) 

7. 

(DCL+,  SRC+,  ORC+, 

PRC—) 

This  new  net  captures  the  same  set  of  linguistic  facts  as  Figs.  11-2  or  11-3. 

The  list  of  features  associated  with  each  arc  can  be  considered  an  elimination  list.  In  this 
simple  example,  if  parsing  is  proceedir  from  left  to  right  along  the  arcs,  one  can  keep  a  fea¬ 
ture  list  and  cross  off  any  feature  rri  l  ed  !l  on  the  arc  that  one  is  about  to  take.  If  all  features 
are  crossed  off,  then  that  arc  canr.ot  be  taxen;  if  not,  then  any  features  that  remain  are  possible 
global  interpretations  for  the  structure  parsed  so  far.  In  LPARS,  the  tests  which  keep  track  of 
these  syntactic  features  are  performed  in  the  action  routines  (as  descr  d  in  Sec.  11- E) . 

Notice  that  the  parsing  process  can  be  started  at  any  place  in  the  net  (not  just  at  an  initial 
state),  and  still  proceed  in  a  syntactically  meaningful  fashion.  Notice  also  that  this  new  transi¬ 
tion  net  does  not  have  the  undesirable  features  cf  the  net  in  Fig.  11-2.  If  a  noun  phrase  is  found 
to  the  left  of  a  verb,  only  one  parse  structure  need  be  created,  as  shown  in  Fig.  11-8. 


SENT _ 

NP~  ~VERB 

A 

Fig.  11-8. 

Section  V  describes  the  parse  structures  created  by  LPARS  in  more  detail.  This  parse 
structure  would  have  associated  with  it  the  syntactic  features  (DCL+,  SRC—,  ORC+,  PRC+)  in¬ 
dicating  that  it  could  be  part  of  a  declarative  sentence  or  of  an  object  or  prepositional  relative 
clause.  As  parsing  continues,  these  features  restrict  the  arcs  which  can  be  taken. 

In  this  way,  syntactic  features  provide  a  way  of  letting  structure  recognized  in  one  part  of 
a  sentence  influence  parsing  done  in  another  part  of  the  sentence,  while  keeping  a  very  stream¬ 
lined  transition  net  skeleton. 

The  crossing  off  of  syntactic  features  in  this  simple  example  was  very  straightforward.  In 
a  more  complex  example,  the  logic  in  the  action  routines  might  involve  more  complex  tests  be¬ 
fore  crossing  off  a  given  feature  Thus,  more  complex  functions  can  br  associated  with  the  arcs 
to  test  the  syntactic  features. 

The  basic  concept  of  keeping  a  list  of  syntactic  features,  which  indicate  different  possible 
global  interpretations  for  the  structure  parsed  up  to  a  given  point,  is  a  very  general  one  and  is 
extendable  to  a  wide  range  of  syntactic  phenomena. 

There  are  two  major  advantages  gained  by  using  syntactic  features:  (1)  They  allow  the  gram¬ 
mar  to  be  expressed  compactly,  thus  cutting  down  on  potentially  exponential  inefficiency;  and 
(2)  the  syntactic  feature  logic  tends  to  have  little  left-to-right  bias,  and  therefore  makes  it  eas¬ 
ier  to  create  a  system  in  which  parsing  can  start  at  any  r.cde  within  n  network,  yet  still  proceed 
in  a  well-defined  fashion. 


2  Tradeoffs  in  Grammatical  Complexity 

In  U3ing  syntactic  features  to  achieve  a  streamlined  syntactic  skeleton,  we  are  really  trading 
off  betwee.i  complexity  of  the  transition  net  structure  (arcs  and  states)  and  complexity  of  the  logic 
in  the  action  routines  that  process  syntactic  features.  This  tradeoff  allows  a  reduction  in  the 
number  of  partial  parse  structures  which  must  be  constructed,  but  increases  the  amount  of  in¬ 
formation  (in  the  form  of  syntactic  features)  v/hich  must  be  associated  with  the  parse  structures. 
For  the  purposes  of  l .PARS,  this  tradeoff  is  a  useful  one. 

3.  Generality  of  This  Approach 

As  described  above,  l.PARS  uses  a  transition  net  grammar  augmented  by  syntactic  features 
as  the  basis  for  its  parsing.  The  grammar  which  LPAPS  uses  is,  however,  a  very  rc  cricted 
one.  It  is  therefore  worth  discuss' ng  how  general  and  e/tendible  the  approach  is. 

This  section  broaches  this  issue  by  first  relating  syntactic  features  to  transformational 

15  16  4 

grammar  ’  and  then  to  the  systemic  grammar  used  by  T.  Winograd  in  his  natural  language 

parser.  It  turns  out  that  syntactic  features,  if  created  by  looking  at  a  set  of  transformations, 

can  be  related  quite  directly  to  these  transformations.  Syntactic  features  are  also  very  similar 

to  the  systemic  grammar  features  used  by  Winograd.  These  comparisons  are  made  to  help  put 

the  syntactic  features  discussed  in  this  report  into  perspective  with  respect  to  other  syntactic 

representations. 

a.  Syntactic  Features  vs  Transformations 

A  transformational  grammar  induces  a  set  of  rules  which  take  the  "deep  structure"  of  a 
sentence  (its  underlying  form)  and  successively  transform  it  to  generate  a  "surface  structure" 

(the  form  which  would  be  spoken  or  written).  Most  linguistic  theory  is  couched  in  this  generative 
transformational  framework. 

This  section  argues  that  a  transition  network  with  a  set  of  syntactic  features  can  be  related 
quite  directly  to  a  transformational  grammar,  if  the  transition  network  is  built  up  by  systemat¬ 
ically  applying  different  .ransformations  to  a  simple  sentence  structure.  Section  Il-B-1  gave  a 
simple  example  of  how  this  can  be  done  for  relative  clause  formation. 

It  is  not  a  trivial  task  to  change  a  transformational  grammar  into  an  equivalent  transition 
net  grammar  augmented  by  syntactic  features.  There  are,  however,  a  number  of  principles 
which  can  serve  as  guidelines.  The  simplest  principle  is  first  to  take  a  transition  net  representa¬ 
tion  of  a  declarative  sentence  (a  simple  example  is  shown  in  Fig.  11-9). 


NP  V  NP  PP 


Fig.  11-9. 


The  transformations  are  then  taken  one-by-one  in  the  order  in  which  they  can  be  applied 
and,  in  a  heuristic  fashion,  appropriate  str’Tture  is  added  to  the  transition  net  for  each  trans¬ 
formation  in  turn.  A  very  simple  example  of  how  this  can  be  done  was  given  in  Sec.  1I-B-1.  In¬ 
corporating  the  effects  of  a  transformation  into  a  transition  net  structure  involves  usually  adding 
some  arcs,  sometimes  adding  additional  states,  and  always  adding  one  or  more  syntactic  features. 
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This  incj  nental  incorporation  of  transformations  was  used  to  create  the  transition  net 
grammar  of  LPARS.  It  is  not  clear  how  easily  one  can  create  an  efficient  transition  net  for  a 
complex  grammar  by  this  process.  The  LPARS  grammar  does  not,  for  instance,  incorporate 
any  of  the  complex  cyclic  c"'cts  that  can  arise  in  some  transformations .  In  general,  the  crea¬ 
tion  of  an  efficient  transition  net/syntactic  feature  grammar  from  a  set  of  transformations  is 
probably  more  a  t  art  than  a  science,  although  one  might  be  able  to  automate  the  process. 

The  basic  point,  however,  is  that  a  transition  net  grammar  with  syntac:ic  features  can  be 
written  so  that  its  creation,  and  therefore  its  structure,  bears  quite  a  direct  correlation  to  a 
set  of  transformations. 

An  incidental  point  in  this  regard  is  that  LAMBDA  transitions  can  model  quite  naturally  the 
effect  of  a  transformation  which  moves  a  structure  from  one  part  of  a  sentence  to  another.  For 
example,  let  us  assume  that  a  transformation  can  move  a  NP  from  one  place  to  another.  V/e 
can  create  a  feature  labeled  A  to  represent  th‘s  change,  and  then  put  the  structure  shown  in 
Fig.  11-10  into  the  transition  net. 


A+,  B-  A-,  B+ 

•  •  » 

A-,  B+  A+,  15- 

Fig.  11-10. 

If  the  transformation  can  move  the  NP  from  position  2  to  position  1,  the  feature  ( A  + )  impliei 
that  the  transformation  has  been  applied,  and  <B + )  implies  that  it  hi.s  not.  When  parsing,  if  the 
NP  is  recognized  in  position  1.  then  the  features  (A+,  B-)  imply  that  the  LAMBDA  transition 
must  be  taken  later.  Similarly,  if  the  LAMBDA  transition  is  taken  in  posi,;on  1,  the  features 
(A—,  B+)  imply  that  the  NP  arc  must  be  taken  in  position  2  (An  illustration  of  this  very  phe¬ 
nomenon  was  seen  in  the  relative  clause  example.  The  ORC  feature  implies  that  the  direct  ob¬ 
ject  NP  has  been  moved  to  the  front  of  the  sentence  and  relativised.  Thus,  a  LAMBDA  transition, 
with  the  ORC  feature  labeled  "■*"  and  other  features  labeled  is  included  in  the  net  to  bypass 
the  direct  object  NP.) 

b.  Syntactic  Features  vs  Winograd1  s  Grammar 

4 

InT.  Winograd's  natural-language  parser,  a  set  of  "systemic  features"  was  used  which  was 
based  on  the  notion  of  a  systemic  grammar.  These  features  allowed  syntactic  constructs  which 
have  been  recognized  to  interact  with  parsing  done  later  in  the  sentence.  The  syntactic  features 
which  LPARS  uses  to  augment  its  transition  net  are  very  similar,  if  not  completely  identical  in 
spirit,  to  these  systemic  features. 

The  features  used  by  Winograd  incorporated  many  more  different  syntactic  structures  than 
does  the  limited  set  of  syntactic  features  used  by  l.PARS,  but  were  implemented  only  in  a  left- 
to-righ;  parser. 

It  would  probably  be  qui*e  straightforward  to  extend  the  LPARS  grammar  directly  to  incor¬ 
porate  the  systemic  set  of  features  used  by  Winograd. 


NP  / 

o 

LAMBDA  \ 
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c.  Ex'.endability  and  "Sjmtactic  Economy" 

The  basic  issue  in  discussing  the  extendability  of  the  LPARS  parsing  approach  is  not  really 
whether  syntactic  features  can  accommodate  a  wider  range  of  syntactic  structure.  We  feel  that 
Wiaograd's  system  demonstrates  that  this  is  possible.  The  real  isrue  is  whether,  at  the  same 
time,  the  system  can  retain  the  desired  "syntactic  economy"  in  the  sense  of  a  streamlined  transi¬ 
tion  net  skeleton. 

The  use  of  syntactic  features  allows  LPARS  to  represent  many  different  permutations  of  the 
structure  of  a  sentence  on  a  single  transition  net  skeleton.  Thus,  for  instance,  if  a  NP  is  found 
to  the  left  of  a  verb,  as  discussed  above,  only  one  syntactic  structure  need  be  created.  There¬ 
fore,  a  potential  exponential  explosion  is  avoided. 

It  is  certainly  conceivable  that  with  a  more  comprehensive  grammar,  such  a  dramatic  col¬ 
lapse  onto  a  single  syntactic  skeleton  might  be  difficult  to  achieve.  On  the  other  hand,  a  great 
deal  of  collapsing  of  syntactic  structure  onto  common  syntactic  framework  cp'  ilnly  would  be 
possible. 

Thus,  the  important  point,  from  the  standpoint  of  this  report,  is  that  the  use  of  syntactic 
features  to  augment  a  transition  net  is  a  very  powerful  tooi  which  allows  many  different  syntactic 
structures  to  be  combined  in  a  single  syntactic  skeleton,  and  which  dramatically  reduces  the  po- 
tentia1  exponential  inefficiencies  inherent  in  a  local  approach  to  parsing,  and  therefore  makes 
these  potential  inefficiencies  much  more  readily  managed  and  easier  to  tolerate. 

The  problems  of  ultimate  extendability  of  this  approrch  to  a  comprehensive  grammar  have 
not  been  systematically  explored. 

C.  LPARS'  GRAMMAR 

The  grammar  which  LPARS  uses  contains  the  four  syntactic  features  discussed  above,  along 
with  seven  others.  These  syntactic  features  allow  LPARS  to  accommodate  an  interesting,  al¬ 
though  admittedly  restricted,  subset  of  English  syntax. 

LPARS'  grammar  allows  the  system  to  recognize  the  following  sentence  structures:  declara¬ 
tive,  imperative,  the  three  relative  clauses  discussed,  yes-no  questions,  and  three  types  of 
Wll-type  questions.  The  system  accommodates  both  active  and  passive  forms  of  all  these  forms 
(except  the  imperative). 


i.  LPARS'  Syntactic  Features 

This  section  discusses  each  of  the  eleven  syntactic  features  used  by  LPAPS  and  demonstrates 
their  effect  in  sample  sentences.  Thus-  features  '■an  be  separated  into  two  groups,  or  "systems." 
of  features.  The  implication  of  the  separation  for  LPARS'  operation  is  diocussed  later. 


The  first  system  of  syntactic  features  is: 
DCL  (declarative) 

IMP  (imperative) 

QQ  (simp'e  question) 

SRC  (subject  relative  clause) 

ORC  (object  relative  clause) 

PRC  (prepositional  relative  clause) 


As  in  "The  table  supports  the  book" 
or  "The  man  puts  the  book  on  the  table". 

As  in  "Place  the  book  on  the  table". 

As  in  "Does  the  table  support  the  book?" 
As  in  "which  supports  the  book". 

As  in  "which  the  table  supports". 

As  in  "on  which  the  mail  places  the  book". 
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SQ  (subject  question)  \s  in  "What  supports  the  book?' 

OQ  (object  question)  As  in  "What  dees  the  table  support?' 

PQ  (prepositional  question)  As  in  "On  what  does  the  man  place  the 

book?' 

The  other  system  of  syntactic  features  includes  only  two  features:  ACT  (active  voice)  and 
PASS  (passive  voice). 

All  the  examples  given  above  illustrate  the  active  voice.  Sample  passive  sentenejs  are: 
"The  bcok  is  supported  ty  the  table.”  and  "What  is  placed  on  the  table  by  the  man?" 

2.  LPARS1  Transition  Network 

Figure  11-11  shows  the  set  of  transition  networks  used  by  LPARS;  they  define  four 
nonterminals: 

SENT  1  a  sentence,  bracketed  by  start  and  end  charactei  s 

SENT  a  sentence  in  any  of  the  different  permutations 

discussed  above 

NP  a  noun  phrase 

PP  a  prepositional  Dhrase 

(Notice  that  this  grammar  differs  slightly  in  form  from  some  grammars  sirce  there  is  no  "verb 
phrase"  nonterminal.  Rather,  the  SENT  nonterminal  has  incorporated  directly  the  verb  phiase 
structure.) 


SENT: 


PREP  NP 


Fig.  11-11.  I  PARS'  transition  network  grammar. 

The  grammar  also  allows  fourteen  terminal  syntactic  classes  which  can  be  grouped  into 
four  rough  categories: 
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(a)  Six  of  the  terminals  are  straightforward:  noun,  erb,  adjective  (ADJ),  determiner 
(DET),  preposition  (PREP),  proper  noun  (PROPN). 

(b)  F  jur  are  special  function  words: 

WH  —  WH-type  words  (such  as  "who",  "whom",  "which",  and  "what") 

DO  —  the  function  word  "docs". 

*3E  —  the  functiop  word  "i3". 

BY  —  the  function  word  us  id  to  meke  a  passive  form,  "by". 

(c)  Two  are  art’ficial  terminals  Included  to  let  the  grammar  know  whether  it  has  rec¬ 
ognized  all  the  input: 

STRCI1  -  the  beginning  cf  the  input 

ENDCll  —  the  end  of  the  input. 

(d i  The  final  terminal,  AFFIX,  is  really  a  subword  morph  (word  affix).  It  represents 
the  encmg  of  a  verb,  which  in  l.PARS  can  be  "s"  (active  present  tense),  "ed"  (pas¬ 
sive),  or  NIL  (imperative  or  root).  (LPARS  expects  all  verbs  to  be  regular.)  The 
AF'FIX  arc  is  treated  very  much  like  a  I.AMBDA  arc  by  most  of  LPARS’  algorithms. 

3.  Restrictions  of  LPARS’  Grammar 

As  is  now  probably  very  clear,  the  syntax  which  LPARS  rec  gnizes  is  quite  restricted. 

This  section  describes  some  of  these  restrictions  explicitly  and  discusses  why  they  were  made. 

(a)  Limited  sentence  structures:  The  emphasis  in  LPARS  is  to  develop  a  localized 
approach  to  parsing.  Therefore,  we  have  kept  the  syntax  quite  limited  to  allow  us 
to  locus  more  fully  on  the  basic  problems  involved  in  the  local  approach. 

(b)  Sma’l  Morph  Problems:  Due  to  the  crudity  of  current  phoneme  recognizers,  we 
decided  to  limit  the  amount  to  which  small  words  and  small  morphs  were  allowed. 
This  decision  results  in  a  number  of  restrictions:  only  singular  noun  phrases  are 
allowed  (therefore,  no  subject-verb  number  agreement  checks  are  necessary); 
only  present  tense  is  allowed,  although  both  active  and  passive  and  root  forms  of 
the  verb  can  be  used  (thus,  no  complex  affix  hopping  considerations  need  be  con¬ 
sidered';  also,  the  use  of  pronouns  is  not  hai'dled. 

(c)  To  make  tne  grammar  simpler  and  more  manageable,  a  few  other  restrictions  are 
made:  only  one  clause  can  modify  a  noun  (although  an  arbitrary  number  of  recur¬ 
sively  nested  modifying  phrases  is  allowed);  also,  conjunctions  are  not  allowed. 

(d)  A  further  restriction,  which  deserves  separate  mention,  is  that  left  recursion  is 
not  allowed  (i.e.,  allowing  a  nonterminal  to  start  with  itself).  The  local  parsing 
algorithms  discussed  in  Sec.  V  would  have  to  be  somewhat  mouified  to  accommodate 
left-recursive  structures. 

These  are  some  of  the  most  glaring  restrictions  which  LPARS  makes.  Clearly,  there  are 
numerous  complex  and  ..ot-so-complex  linguistic  constructs  which  LPARS  also  does  not  account 
for.  None  of  these  restrictions  is  made  necessary  by  any  inherent  incapability  of  the  LPARS 
approach.  Rather,  we  felt  that,  due  to  the  crudity  of  current  front-end  analyzers,  it  would  be 
overly  ambitious  to  tr--  to  incorporate  too  much  structure  at  this  time,  parti lularb'  structure 
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that  depended  toe  much  on  small  function  words  and  smiu.  morphs  such  as  noun  and  verb  endings 
Also,  the  primary  focus  of  the  present  work  is  on  developing  the  basic  structure  of  the  local  ap¬ 
proach,  in  a  limited  sys  ,em  using  techniques  that  are  general  and  extendable  Once  such  an  ap¬ 
proach  is  understood,  extension  is  a  much  easier  task. 

D  SEMANTIC  FEATURES 

This  section  describes  the  semantic  component  of  LPARS.  Consisting  primarily  of  semantic 

feature  checks,  LPARS'  semantic  processing  is  quite  limited.  Semantic  featu.-es  are  a  widely 
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used  means  of  incorporating  a  semantic  component  into  a  syntactic  formalism. 

Each  word  in  LPARS'  vocabulary  has  a  list  of  semantic  features  describing  attributes  of 
that  word.  Each  semantic  feature  consists  of  a  name  and  either  a  value  or  a  list  of  possible 
values. 

1.  Semantic  Feature  Checks 

The  feature  list  of  the  noun  "table"  includes  the  features  (AN1M  MINUS  SUPP-SUB  PLUS 
CONT-SUB  MINUS): 

ANIM  MINUS  means  that  a  table  is  inanimate. 

SUPP-SUB  PLUS  means  that  a  table  can  support  objects 
CONT-SUB  MINUS  means  that  a  table  cannot  contain  objects. 

The  feature  list  of  the  verb  "support"  includes  the  features  (OONT  MINUS  SUPP  PLUS): 

CONT  MINUS  means  that  the  action  is  not  that  of  containing. 

SUPP  PLUS  means  that  the  action  is  that  of  supporting. 

The  feature  list  of  the  preposition  "on"  similarly  includes  the  features  (CONT  MINUS  SUPP 
PLUS). 

Two  examples  of  typic.  semantic  feature  checks  involving  these  features  follow. 

Example  i . 

If  a  verb  has  the  feature  (SUPP  PLUS),  check  that  the  subject  of  that  verb  has  the  feat  :re 
(SUPP-SUB  PLUS).  Thus,  "The  kitchen  supports  the  book."  would  not  be  recognized  as  an  allow¬ 
able  sentence,  while  "The  table  supports  the  book."  would  be  accepted. 

Example  2. 

If  a  preposition  has  the  feature  (SUPP  PLUS),  check  that  the  object  of  that  preposition  has 
the  feature  (SUPP-SUB  PLUS).  Thus,  the  phrase  "on  the  kitchen"  would  not  be  recognized  as 
a  prepositional  phrase,  whereas  "on  the  tab):”  would  be  recognized. 

2,  Semantic  Feature  Checks  Used  by  LPARS 

There  are  five  different  general  cases  of  semantic  feature  agreement  enforced  by  LPARS. 
These  are  all  enforced  by  testing  appropriate  features,  namely: 

Subject-Verb  Agreement:—  The  subject  must  be  capable  of  performing  the  action  asso¬ 
ciated  with  the  verb.  If  the  sentence  is  passive,  of  course,  it  a  the  "deep-structure" 
subject  that  must  be  checked  for  this  agreement 
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The  table  supports  the  book. 

*  The  table  contains  the  book. 

The  book  is  supported  by  the  table. 

*  The  book  is  contained  by  the  table. 

The  man  places  the  book  on  the  table. 

*  The  sofa  places  the  book  on  the  table. 

Verb-Object  Agreement:—  The  object  must  be  capable  of  having  the  verb  action  per¬ 
formed  to  it. 

*  The  table  supports  ihe  kitchen. 

*  The  man  places  the  fireplace  on  the  table. 

Preposition-Object  Agreement:—  Since  a  preposition  denotes  a  relation,  the  object  cf 
a  preposition  must  be  able  to  maintain  that  _  ^aiionship  with  other  >bjects 

on  the  table 

*  on  the  kitchen 

*  in  the  table 

Modified  Noun-Preposition  Agreement:-  Similarly,  the  noun  modified  oy  a  preposi¬ 
tional  phrase  must  be  able  to  maintain  the  relationship  specified  by  the  preposition  with 
other  objects. 

*  the  fir«.  )lace  on  the  table 

♦  the  livingroom  in  the  bookcase 

Adjective-Noun  Agreement:-  Only  certain  adjectives  can  modify  a  noun. 

the  friendly  person 
*  the  green  person 
>  the  rectangular  man 
the  small  dish 

As  with  most  attempts  to  introduce  order  into  linguistic  phenomena,  it  i3  certainly  possible 
to  imagine  violations  of  these  simple  semantic  feature  constraints.  For  instance,  it  is  conceiv¬ 
able  that  in  free  speech,  one  might  refer  to  "the  rectangular  man."  As  a  result,  a  set  of  feature 
checks  of  the  sort  used  by  LPARS  is  useful  only  when  the  scene  being  talked  about  is  restricted 
and  well  understood. 

Section  II— E  describes  how  the  enforcement  of  these  semantic  feature  checks  fits  into  the 
transition  net  grammar. 

3.  Semantic  Features  vs  Syntax 

1 3 

It  has  been  argued  that  "semantic"  feature  constraints  of  the  sort  used  by  LPARS  (often 
called  "selectional  restrictions")  are  really  syntactic  in  character.  This  argument  can  be  under¬ 
stood  by  observing  that  one  can  transform  a  semantic  feature  grammar  into  an  equivalent  gram¬ 
mar  without  any  semantic  features  by  using  many  more  nonterminal  and  terminal  syntactic  classes. 
Instead  of  allowing  nonterminal  nodes  to  be  flagged  with  the  semantic  features  of  component  words, 
the  new  grammar  would  have  a  special  nonterminal  for  each  combination  of  semantic  features 
possible  in  the  old  grammar.  In  this  new  grammar,  all  the  semantic  feature  information  is  im¬ 
plicit  in  the  name  of  any  given  syntactic  class. 
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This  argument  is  a  valid  c.'“.  It  points  up  tne  limitations  of  the  semantic  processing  pres¬ 
ently  incorporated  into  I  PARS. 

4.  Extensions  to  the  Semantic  Pro'-vssing 

One  interesting  extension  of  LPaR3'  semantic  component  would  be  to  let  the  parsing  tak° 
place  in  the  context  of  a  particular  "scene"  being  talked  about.  Thus,  if  the  words  "the  dictionary 
on  the  table"  are  parsed,  these  would  only  be  formed  into  a  noun  phrase  with  a  modifying  prep¬ 
ositional  phrase  if  an  instance  of  such  an  object  existed  in  the  sr  ene  being  discussed. 

Similarly,  relative  clause  constructs  such  as  "which  the  man  placed  on  the  table”  would  only 
be  recognized  if  some  event  had  occurred  in  the  course  of  previous  interaction  which  could  be 
related  to  the  event  bem.g  described  in  the  clause. 

This  extension  of  the  system  could  be  a  major  project.  Yet  it  would  fit  quite  naturally  into 
the  overall  I. PARS  framewor  Such  an  approach  would  involve  using  semantic  processing  of  a 
clearly  non-syntactic  variety. 

E.  ACTiON  ROUTINES 

We  now  describe  the  operation  of  the  action  routines  associated  with  the  arcs  and  terminal 
states  of  the  grammar.  The  action  routines  examine  syntactic  and  semantic  features  to  deter¬ 
mine  when  use1  of  the  arc  or  terminal  state  is  to  be  allowed.  Central  to  these  routines  are  the 
information  lists  which  they  operate  upon. 

1.  Information  Lists 

As  parsing  proceeds,  the  relevant  syntactic  and  semantic  information  is  kept  in  information 
lists  which  are  attached  to  each  parse  structure  built  up.  For  terminals,  the  information  list 
consists  of  the  list  of  semantic  features  associated  with  that  word.  For  nonterminals,  the  in¬ 
formation  list  is  constructed  by  the  action  routine.,  as  described  below. 

An  information  list  has  the  form  of  a  LISP  property  list:  an  alternating  sequence  of  tags  and 
values  (TAG1  VALUE1  TAG2  VALUE2  . .  .  ).  A  value  may  itselr  be  a  property  list  in  turn. 

This  property  list  corresponds  quite  closely  to  the  set  of  registers  which  Woods*  *  uses  to  aug¬ 
ment  his  transition  net  grammar.  Therefore,  in  this  section,  the  tag-value  pairs  are  referred 
to  as  registers. 

As  parsing  takes  place,  the  action  routines  examine  the  information  in  these  registers  to 
see  if  the  use  of  in  arc  or  terminal  state  is  to  be  allowed  If  so,  they  place  updated  information 
into  the  registers  (actually  into  a  new  copy  of  th-  registers)  which  can  be  examined  in  turn  by 
later  action  routines. 

a.  Registers  Used  by  the  SENT  Net 

This  section  describes  the  registers  used  by  the  action  routines  in  the  SENT  net.  These 
registers  are: 

FEATL1ST  (syntactic  feature  list)  The  value  of  this  register  is  the  list 

of  syntactic  features,  together  with 
their  values  (PLUS  or  MINUS). 

SSUB  (surface  subject)  This  register  contains  the  semantic 

feature  information  describing  the 
surface  subject  of  the  sentence.  The 
register  lz  set  by  the  action  routine 
of  the  NP  arc  which  corresponds  to 
the  surface  subject. 
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.OBJ  (surface  object) 


BY-NP  ("by"  r.  :n  phrase, 
or  "passive  subject") 


WH  (WH  word) 


PREP  (preposition) 


PP  (prepositional  phrase) 


VERB  (verb) 


This  register  contains  the  semantic 
feature  information  describing  the 
surface  object  of  the  sentence.  The 
register  is  set  by  the  action  routine 
of  the  NP  arc  which  corresponds  to 
the  surface  object. 

This  register  contains  the  semantic 
information  describing  the  deep  struc¬ 
ture  subject  of  a  passive  sentence  (i.e., 
the  NP  "the  table"  in  "The  book  is  sup¬ 
ported  by  the  table.").  The  ’-egister  is 
set  by  the  action  routine  corresponding 
to  this  NP  in  the  net. 

In  WH-type  questions  and  relative 
clauses,  this  roister  s  set  to  contain 
the  semantic  informat  .on  describing 
the  WH  word  (i.e.,  wl.at,  who,  whom, 
which). 

In  a  prepositional  relative  clause  (PRC) 
or  a  prepositional  question  (PQ),  this 
register  contains  the  information  de¬ 
scribing  the  preposition. 

In  a  sentence  with  a  verb  requiring  a 
propositional  phrase  (such  as  "place", 
or  "put"),  this  register  contains  the 
semantic  information  describing  that 
phrase. 

This  register  contains  the  semantic 
information  describing  the  verb. 


If,  during  the  parsing  of  a  given  sentence  (or  sentence  iragment),  any  of  these  structures 
has  not  been  recognized,  then  the  associated  register  contain.'  NIL. 


2.  Two  Examples 

We  now  give  two  examples  of  how  these  registers  are  used.  Let  us  assume  that  we  are  using 
the  SENT  net  to  parse  ihe  phrase  "which  the  table  supports".  In  parsing  this  structure,  a  num¬ 
ber  of  arcs  are  traversed  in  the  SENT  net.  To  determine  that  an  arc  can  be  traversed,  the  action 
routine  associated  with  that  arc  must  be  called.  That  action  routine  accepts  as  input  two  informa¬ 
tion  lists: 

(a)  The  information  list  of  the  SENT  net  parsed  so  far. 

(b)  The  information  list  of  the  terminal  or  nonterminal  currently  being  parsed  (in  this 

example,  the  WH,  NP,  and  VERB  successively). 

The  action  routine  returns  NIL  if  the  arc  cannot  be  traversed.  Otherwise,  it  returns  a  new 
information  list  representing  the  entire  SENT  structure  parsed,  including  the  current  arc. 

Example  1 

"which"  The  action  routine  associated  with  the  WH  arc  in  the  grammar 

sets  ihe  WH  register  to  contain  the  information  of  the  word 
"which".  It  also  sets  the  FEATLIST  register  to  indicate  that 
the  structure  is  either  PRC,  ORC,  or  CP.C,  either  active  or 
passive. 

"the  table"  The  action  routine  associated  with  the  surface  subject  NP  sets 
the  SSUB  register  to  contain  the  Semantic  information  asso¬ 
ciated  with  the  NP  "the  table".  It  sets  the  FLATL'ST  register 
to  indicate  that  the  structure  is  either  PRC  >r  ORC,  and  that 
it  is  active. 
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cupports 


The  action  routine  associated  with  the  verb  arc  iu  railed,  and 
since  the  sentence  is  active,  it  checks  to  see  if  the  NP  de¬ 
scribed  by  the  SSUB  register  can  be  the  subject  of  the  vert. 

Had  the  sentence  been  passive,  it  would  have  checked  to  see 
if  the  SSL'B  NP  could  be  the  object  of  the  verb.  If  the  S3UB 
NP  had  not  been  recognized  (as  would  have  been  the  case  when 
parsing  a  sentence  fragment  starting  with  the  verb),  then  no 
check  would  have  been  made.  The  action  routine  then  sets 
the  VERB  register  to  contain  the  semantic  informat  on  describ¬ 
ing  the  verH  The  information  in  the  FEATL1ST  register  stays 
the  same. 


Example  2 

If  the  words  being  parsed  are  '*  which  the  kitchen  supports',  the  parsing  would  not  succeed, 
lince  the  semantic  checks  are  not  satisfied.  The  parsing  of  the  Wll  and  the  NP  would  be  the 
saute  as  in  the  previous  example,  but  the  action  routine  associated  with  the  verb  would  return 
NIL  after  it  finds  that  the  NP  cannot  be  the  subject  of  the  verb. 

3.  Brief  Summary  of  Action  Routines 

Each  transition  arc  action  routine  takes  as  input  iwo  lists  of  information:  one  made  up  by 
the  previous  parsing  in  the  net,  the  other  from  the  terminal  or  nonterminal  currently  being  parsed. 

The  routine  processes  these  two  lists  of  information  to  make  sure  that  the  arc  can  be  trav¬ 
ersed.  If  so,  a  new  updated  list  of  information  is  returned. 

The  initial  and  final  state  action  routines  are  similar  but  accept  only  a  single  input,  the  ):st 
of  information  made  up  by  parsing  of  previous  arcs  in  the  net,  and  determines  whether  the  struc¬ 
ture  which  has  been  parsed  can  begin  or  end  (as  appropriate)  at  that  particular  state. 

4.  Parsing  I,ocally 

Since  LPARS  uses  the  transition  net  to  parse  locally,  the  action  routines  must  be  written  so 
that  the  system  can  start  left-to-right  parsing  from  any  place  in  the  net.  It  is  not  difficult  to 
structure  the  action  routines  to  allow  this  flexibility,  but  it  does  require  some  thought. 

One  potential  problem  is  that  semantic  feature  checks  cannot  assume  that  a  given  construct 
will  always  have  been  parsed  just  because  it  is  to  the  left  of  the  current  arc  being  traversed. 

For  instance,  when  parsing  a  verb  in  a  declarative  sentence,  one  cannot  assume  that  the  surface 
subject  will  always  have  been  recognized.  As  a  result,  any  semantic  test  must  first  ascertain 
that  the  required  registers  contain  semantic  information. 

Similarly,  syntactic  feaiu’-e  checking  must  be  set  up  so  that  it  can  start  anywhere  in  the  net. 
This  is  not  difficult  to  achieve.  In  fact,  one  of  the  major  advantages  of  using  syntactic  features 
is  tnat  the  syntactic  feature  logic  consists  mainly  of  crossing  off  all  features  which  cannot  use 
an  arc  or  terminal  state,  and  thus  has  little  built-in  lcft-to-right  bias. 
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ill.  LEXICAL  RECOGNITION  AND  PHONETIC  SCRAMBLING 


This  section  describes  the  approach  to  word  recognition  taken  by  LPAKS.  Since  LPARS  is 
not  connected  to  a  working  phoneme  recognizer,  there  are  two  parts  to  this  description:  first, 
how  LPARS  generates  "scrambled  input"  which  simulates  input  obtained  from  a  phoneme  recog¬ 
nizer;  and  second,  how  the  word  recognition  algorithm  processes  this  input  to  reconstruct  werds 
which  might  be  present. 

in  designing  the  phonetic  scrambler  and  the  word  recognizer,  a  balance  has  been  attempted 
between  simplicity  on  the  one  hand,  and  acceptably  realistic  simulation  of  real  phonetic  input  on 
the  othei .  The  main  emphasis  of  the  present  work  is  on  the  use  of  higher-level  linguistic  con¬ 
straints  to  guide  lexical  analysis,  rather  than  on  developing  sophisticated  meth  of  performing 
lexical  analysis  with  a  given  front  end. 

There  are  some  quite  complex  phenomena  which  can  affect  lexical  recognition  with  a  real 
front  end.  These  phenomena  include  coarticulation;  supra- segmental  effects  involving  pitch,  in¬ 
tensity,  and  stress;  speaker-dependent  effects;  etc.  The  present  research  ignores  these  phe¬ 
nomena  for  two  reasons.  First,  any  attempt  to  deal  meaningfully  with  tnem  in  the  absence  of  a 
working  front  end  would  be  difficult.  Second,  incorporating  such  phenomena  into  th  lexical  nroc- 
essing  of  the  LPARS  system  might  cloud  the  fundamental  higher- level  issues  involved  in  LPARS 
design. 

Thus,  the  lexical  scrambling  and  recognition  scheme  used  by  the  LPARS  is  not  presented  as 
the  last  word  in  lexical  recognition  of  continuous  speech.  On  the  contrary,  it  is  presenter  as  a 
simple  scheme  which  is  easy  to  understand,  and  which  preserves  the  basic  characteristics  of 
spoken  input. 

Our  work  makes  only  one  fundamental  assumption  -  that  the  matching  of  a  word  against  a 
section  of  input  can  be  done  at  a  variety  of  levels  of  confidence.  In  .he  present  work,  phonetic 
distance  is  used  to  express  these  levels  of  confidence. 

In  LPARS,  phonetic  distances  arc  determined  from  the  confusion  statistics  used  when  the 
scrambler  introduces  error  into  the  input  it  prepares  for  LPARS.  With  a  real  front  end,  of 
course,  phonetic  distances  would  be  based  on  the  statistical  likelihood  of  the  various  errors 
possible. 

A.  PHONETIC  SCRAMBLING 

We  now  describe  SCRAMBLE,  the  phonetic  scrambler  which  produces  input  for  LPARS.  The 
scrambler  accepts  as  input  a  string  of  phonemes  correctly  spelling  an  utterance.  The  scrambler 
transforms  this  correct  string  of  phonemes  into  a  scrambled  string  containing  substantial  error. 
The  error  is  created  by  substitution  and  deletion  of  the  input  phonemes.  SCRAMBLE  does  not,  at 
present,  insert  phonemes  into  the  input. 

In  making  these  substitutions  and  deletions,  SCRAMBLE  uses  the  phoneme  similarity  table 

1  o 

found  in  Appendix  10  of  the  ARPA  Speech  Report.  This  phoneme  similarity  table  (the  "PST"), 
reproduced  as  Appendix  A  of  this  report,  ranks  each  of  36  phonemes  against  one  another  on  a 
scale  from  0  (meaning  completely  dissimilar)  to  100  (meaning  identical).  These  similarity 
measures  are  a  bit  artificial  since  they  are  based  on  a  somewhat  abstract  notion  of  phoneme 
similarity,  rather  than  on  a  rigorous  evaluation  of  a  specific  phoneme  recognizer.  The  table  is, 

g 

however,  loosely  correlated  to  the  Vicens-Reddy  front  c.id.  Although  this  table  is  used  by 
LPARS  for  its  lexical  processing,  the  approach  taken  is  quite  general  and  is  not  dependent  on 
the  particular  statistics  in  this  tabic. 
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START 


Fig.  Ill- 1 . 

Figure  III- 1  shows  in  flowchart  form  how  a  correctly  spelled  sentence  is  scrambled. 
SCRAMBLE  examines  each  phoneme  in  turn,  first  deciding  whether  or  not  to  delete  that  phoneme. 
If  not,  SCRAMBLE  then  decides  what  phoneme  to  substitute.  These  decisions  are  made  randomly, 
weighted  by  the  numbers  in  the  PST  as  described  in  Secs.  1  and  2  beiow.  The  scrambling  might 
have  been  more  complete  had  insertion  of  phonemes  been  allowed.  This  would  have  required  only 
a  simple  modification  of  the  system. 

1.  Substitution 

For  each  phoneme  that  is  not  deleted,  SCRAMBLE  must  decide  what  phoneme  is  to  be  substi¬ 
tuted  for  it.  The  following  example  describes  this  procedure. 

Let  us  assume  that  the  '•Trect  input  phoneme  is  "T".  The  PST  ranks  the  similarity  of 
all  other  phonemes  to  "T"  on  a  scale  of  0  to  100.  SCRAMBLE  must  determine  which  of  these  to 
substitute  for  the  "T".  For  each  input  phoneme  which  is  not  deleted,  the  percentage  likelihood 
that  SCRAMBLE  chooses  a  substitute  phoneme  in  a  given  similarity  range  is: 


1  percent 

between  0  and  40 

1  percent 

between  40  and  50 

Z  pci  cent 

between  50  and  60 

2  percent 

between  60  and  70 

4  percent 

between  70  and  80 

40  percent 

between  80  and  90 

50  percent 

between  90  and  100 

(if  nc  phonemes  exist  In  a  particular  similarity  range,  phoneme  from  the  next  higher  similarity 
range  is  chosen). 

This  particular  mapping  was  chosen  by  examining  tables  in  Appendix  10  of  the  ARPA  Re¬ 
port,19  and  trying  to  correlate  roughly  the  PST  to  the  performance  characteristics  of  the  Vicens 
phoneme  recognizer. 

The  actual  percentages  used  can  be  varied  to  test  the  system  at  various  levels  of  input  error. 
The  values  given  here  are  those  used  in  the  system  evaluation  described  in  Sec.  Vll.  Figure  I1I-2 


90  to  100:  P,T,D,CH,K,F,TH 

80  to  90:  B,J,G,V,DH 

70  to  80:  S,H,SW 

60  to  70:  Z,SH.ZH,M,N,NG,Y,I 

50  to  60:  W,R,L,U 

40  to  50:  A(E 

0  to  40:  OO  0,AW, AA,AR, AE 
Fig.  111-2. 

indicates  which  phonemes  fall  into  each  of  the  different  similnritv  ranges  with  respect  to  "T" 

(A  list  of  the  phonemes  used,  together  with  pronunciation,  is  given  in  Appendix  A.  The  phoneinie 
representation  used  here  is  that  of  the  A11PA  Report.  Most  phonemes  are  fairly  obvious,  except 
possibly  "SW"  which  is  the  neutral  vowel  "a".  ) 

The  result  of  this  scrambling  is  that,  10  percent  of  the  time,  "T"  is  scrambled  quite  badlv 
(into  a  vocalic  phoneme  or  a  fricative).  The  rest  of  the  time  the  substitution  is  made  almost  at 
random  from  a  number  of  stop-like  phonemes.  Thus,  the  likelihood  of  preserving  the  "T"  is 
quite  low. 

Other  phoremes  (especially  vowels)  tend  to  have  fewer  phonemes  ranked  as  being  very  simi¬ 
lar  in  the  PST.  For  these  phonemes,  the  likelihood  of  preserving  the  original  phoneme  is  much 
greater.  Thus,  SCRAMBLE  simulates  a  front  end  which  recognizes  vowels  much  better  than 
specific  consonants.  This  is  consistent  with  the  Vieens-Heddv  front  end,  on  which  the  PST  is 
loosely  based. 

As  discussed  in  Sec.  Ill—  H,  the  phonetic  distances  used  by  LPARS  are  set  up  to  reflect  these 
scrambling  statistics.  As  a  result,  any  two  phonemes  with  similarity  gr>-««n,.  Ih.nn  90  are  at  a 
phonetic  distance  of  zero  from  one  another,  since  at  best  substitutions  are  chosen  at  random 
from  the  group  of  phonemes  with  similarity  90  or  above. 

Therefore,  although  SCRAMBLE  chooses  a  single  phoneme  to  replace  each  input  phoneme, 
this  phoneme  really  represents  a  set  of  phonemes  within  zero  phonetic  distance  of  that  phoneme 
(above  90  similarity  in  the  PST).  Thus,  the  scrambler  simulates  a  front  end  whieh  often  cannot 
uniquely  identify  a  phoneme,  but  can  only  roughly  classify  it.  The  Vicens- Reddy  front  end,  for 
instance,  recognizes  a  stop-like  segment  as  a  stop  without  trying  to  identiry  which  stop  it 
might  be. 

As  a  result,  the  string  of  individual  phonemes  produced  by  SCRAMBLE  is  perhaps  hest  en¬ 
visioned  as  a  string  of  possible-p  lonere  lists,  each  containing  several  phonemes.  Thus,  LPmRS 
input  contains  implicit  amhig1'  as  well  as  error. 

2.  Deletions 

SCRAMBLE  decides  whether  to  delete  a  phoneme  randomly,  weighted  by  its  similarity  to 

both  of  its  neighbors  A  phoneme  bordered  by  similar  phonemes  is  more  likely  to  be  deleted 

than  one  bordered  bv  dissimilar  phonemes.  This  approach,  suggested  in  Appendix  10  of  the 
1 9 

ARPA  Report,  is  a  reasonable  approximation  of  front-end  behavior  since,  for  instance,  one 
would  expect  a  vowel  to  be  more  likely  deleted  when  surrounded  by  vocalic  segments,  and  a  con¬ 
sonant  to  be  more  likely  deleted  when  part  of  a  consonant  cluster. 
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SCRAMBLE  deletes  about  15  percent  of  the  phonemes  it  processes.  It  deletes,  at  most, 
two  phonemes  in  a  row.  The  probability  of  deleting  a  phoneme  immediately  after  a  previous  de¬ 
letion  is  half  that  of  the  normal  probability. 

3.  Example 

We  now  present  an  example  of  phonetic  scrambling  to  help  give  a  more  concrete  feeling  for 
LPARS1  scrambled  input.  First,  a  string  of  27  T's  is  scramb  d: 

Input 

(TTTTTTTTTTTTTTTTTTTTTTTTTTT) 

Scrambled  String 

<TH  DTTTHBKFSWCHBBPFFT) 

This  gives  an  idea  of  the  type  ol  substitutions  made.  The  number  of  deletions  made  here, 
however,  is  extremely  high,  because  the  likelihood  of  deletion  is  affected  by  the  similarity  of 
adjacent  phonemes. 

Next,  similar  scrambling  is  performed  with  a  string  of  S's  and  A's; 

Input 

(SSSSSSSSSSSSSSSSSSSSSSSSSSS) 

Scrambled  String 

(EE  SSZHHZHSZIIHSHSZZHZSHII) 

Input 

(AAAAAAAAAAAAAAAAAAAAAAAAAAA) 

Scrambled  String 

(AE  A  AR  AE  O  A  A  A  A  A  AE  O  A  TH  A  AE  A  O) 

Notice  that  the  string  of  A's  (vowels)  is  not  as  badly  scrambled  as  the  T's  or  S's. 

In  the  next  example,  the  string  "S  A  T  S  A  T  .  . .  "  is  scrambled. 

Input 

(SATSATSATSATSATSATSATSATSAT) 

Scrambled  String 

(H  A  SW  H  O  TJ!  !!  A  DH  H  A  S  AE  P  H  A  K  H  AR  D  S  E  A) 

Here  there  is  much  less  deletion,  since  adjacent  phonemes  are  quite  dissimilar. 

Finally,  the  sentence  "The  green  coffeetable  supports  the  dictionary"  is  scrambled-  Its 
correct  spelling  is: 

(TH  SW  G  R  EE  N  K  AW  F  EE  T  AA  B  L  S  SW  PORTS 
TH  SW  D  I  K  SH  SW  N  AA  R  EE) 


JO 


Two  examples  of  this  sentence,  after  scrambling,  are: 

(IDY1NGDHAAKESWAAKYSHSWBOWFH 

KSWGKAASWAAR1) 

(SW  G  L  W  DH  AA  EE  H  AA  PUBOKLEZ  DH  DH  1 
P  H  SW  M  L  EE) 

Offhand,  it  might  seem  unlikely  that  the  original  words  are  retrievable  from  this  scrambled 
hodge-podge.  It  is  difficult  for  a  person  to  even  match  up  parts  of  the  scrambled  sentence  to  the 
original  The  computer,  however,  can  do  a  much  more  thorough,  patient,  systematic  job  of  lexi¬ 
cal  matching  than  a  person  can.  In  fact,  the  scrambled  sentence  is  somewhat  misleading  to  a 
human  because  it  is  overspecified,  since  each  phoneme  really  represents  a  set  of  phonemes 
which  are  equivalent  under  the  scrambling. 

B.  MEASURING  PHONETIC  DISTANCE 

In  LPARS,  phonetic  distances  of  individual  phoneme  substitutions  and  deletions  are  added  to¬ 
gether  to  determine  the  overall  phonetic  distance  of  a  word. 

Since  the  phonetic  distances  are  added  together,  they  must  be  logarithmically  related  to  the 
probabilities  of  substitution  and  deletion.  Thus,  if  one  word  matches  an  input  segment  but  for 
two  errors  of  1/3  likelihood  each,  and  another  word  of  equal  length  matches  but  for  a  single  error 
of  1/9  likelihood  (a  more  severe  error),  the  likelihood  of  each  word  being  present  is  equal  (since 
the  likelihood  of  two  errors  of  l/3  probability  occurring  is  1/3  *  1/3  =  l/9).  Therefore,  their 
total  phonetic  distances  must  be  equal.  This  will  only  be  true,  in  general,  if  phonetic  distances 
are  logarithmically  related  to  the  error  probabilities. 

The  actual  scale  used  to  measure  the  distances  is  not  important.  LPARS  measures  phonetic 
distance  on  a  scale  from  0  to  120.  Phonemes  with  similarity  scores  of  90  to  100  are  ol'  distance 
0  from  one  another.  Phonemes  with  similarity  scores  of  0  to  40  are  of  distance  120  from  one 
another,  representing  the  probability  of  1  percent. 

Figure  111-3  snows  graphically  how  error  probability  and  phonetic  distance  are  related  in 
LPARS.  With  a  real  front  end,  phonetic  distances  would  be  based  on  the  statistical  probability 
of  the  various  possible  errors.  The  phonetic  distance  associated  with  beth  substitutions  and  de¬ 
letions  corresponds  to  the  probability  o'  that  error  occurring  in  the  scrambler. 


0  10  20  50  40  50 


PROBABILITY  OF  £RR0R 

Fig.  Ill- 3. 
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C.  WORD  RECOGNITION  ALGORITHM 


This  section  describes  the  use  of  phonetic  distance  to  measure  how  accurately  the  phonetic 
spelling  of  a  word  matches  a  particular  section  of  input. 

Figure  III- 4  shows  a  simple  example  of  how  the  matching  is  done.  The  dictionary  spelling 
of  the  noun  "dictionary"  is  compared  with  three  sample  sections  of  input.  (At  present,  the  sys¬ 
tem  does  not  consider  multiple  pronunciations  of  a  word.)  In  LPARS,  the  input  quality  is  much 
wot  se  than  these  samples  might  seem  to  imply.  This  example  is  kept  simple  for  clarity;  also, 
the  phonetic  distances  have  been  chosen  arbitrarily. 


Spelling 

D  I  K  SH  SW  N  AE  R  EE 


Tote!  Distance 

Distance  per  Phoneme 


Input  Samples 


F  1 
40 

K 

SH 

SW 

N 

AE 

R 

EE 

40 

D  1 

K 

50 

SW 

N 

AE 

R 

EE 

50 

F  1 
40 

K 

50 

SW 

N 

AE 

R 

EE 

90 

4 

5 
10 


Fig.  £11-4. 

In  matching  a  word  against  a  section  of  input,  the  phonetic  distances  of  individual  substitu¬ 
tions  or  deletions  are  added  together  yielding  a  total  distance  for  the  entire  word.  The  algorithm 
searches  for  that  combination  of  substitutions  and  deletions  that  yields  the  lowest  total  distance. 

LPARS  always  performs  its  lexical  matching  using  a  threshold  maximum  distance-per- 
phoneme.  During  the  various  levels  of  LPARS  analysis,  the  threshold  gets  progressively  higher 
and  higher  as  more-and-more  context  is  used  to  indicate  where  a  particular  word  might  be  pres¬ 
ent.  Only  words  which  match  within  the  threshold  are  recognized.  Thus,  if  this  threshold 
distance- per- phoneme  is  10,  no  matches  of  the  word  "dictionary"  are  accepted  with  total  dis¬ 
tance  greater  than  90  (10  *  the  word-length). 

A  very  simple  algorithm  is  used  to  compute  the  lowest- distance  match  of  a  word  to  an  input 
segment.  The  algori  hm  performs  a  Ieft-to-right  comparison  of  the  dictionary  spelling  of  a  word 
against  a  sectio..  of  ;nput,  trying  all  possibilities  of  substitution  and  deletion.  This  process  is 
essentially  a  tree  search,  of  a  substitution-deletion  tree,  to  find  that  branch  of  the  tree  with  the 
lowest  score.  If  the  score  of  any  partial  branch  being  explored  exceeds  the  threshold  distance, 
the  investigation  of  that  branch  is  aborted.  If  all  branches  are  aborted,  then  the  word  does  not 
match  the  input  segment  within  the  threshold  distance.  Appendix  C  gives  a  flowchart  of  this 
algorithm. 

Although  the  algorithm  is  very  simple,  it  does  generate  a  great  deal  of  computation.  In  the 
initial  scar1  of  the  utterance,  for  instance,  virtually  every  word  in  the  vocabulary  is  matched 
against  the  entire  input,  i.e.,  starting  at  every  possible  phoneme  position.  This  initial  scan,  of 
course,  is  do*".e  at  a  fairly  low  phonetic  distance,  which  limits  somewhat  '.he  amount  of  computa¬ 
tion  required. 
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D.  SOME  COMMENTS 


We  now  make  two  comments  on  the  approach  taken  Were  toward  lexical  recognition. 

1.  Local  Exponential  Explosion  vs  Vertical  Organization 

The  lexical  recognition  algorithm  clearly  has  the  potential  for  exponential  explosion  of  com¬ 
putation  as  the  threshold  is  raised  higher  and  higher,  and  a  deeper- and- deeper  search  is  needed 
of  the  substitution- deletion  tree  when  matching  words.  On  the  other  hand,  the  initial  scan  is 
made  at  a  fairly  low- distance  threshold  (looking  for  words  that  are  not  too  garbled),  and  the 
amount  of  work  required  to  match  words  at  a  low  phonetic  distance  is  less  than  the  amount  re¬ 
quired  for  a  high  threshold. 

One  of  the  advantages  of  the  vertical  organization  of  LPARS  is  that  most  of  the  local  lexical 
matching  is  done  at  a  fairly  low  threshold,  and  higher- threshold  scans  need  be  done  only  very 
selectively,  in  specific  places,  looking  for  specific  subsets  of  the  total  vocabulary. 

2.  Extendability  of  This  Lexical  Approach 

The  lexical  recognition  algorithm  is  not  presented  as  the  last  word  in  lexical  matching;  it  is 
a  deliberately  simple  scheme.  With  a  real  front  end,  there  are  many  refinements  that  could  make 
the  matching  more  accurate.  The  nature  of  these  refinements,  however,  is  not  well  understood 
and  constitutes  a  research  problem  in  its  own  right.  In  this  report,  the  main  emphasis  is  on  the 
use  of  higher- level  constraints  to  help  guide  local  processing.  Even  with  mor*  «~efined  lexical 
matching  techniques,  the  basic  concept  of  initially  looking  for  words  thui  match  well,  and  later 
selectively  searching  for  words  that  match  les:1  well,  would  be  equally  valid 
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IV.  INITIAL  STAGES:  LOCAL  WORD  RECOGNITION 


The  first  two  stages  in  LPAHS'  processing  of  an  utterance  are  now  described.  First,  the 
initial  low  phonetic  distance  scan  through  the  entire  utterance  looking  for  long  words  (content 
words)  that  are  not  too  garbled;  and  second,  the  higher-distance  matching  in  the  areas  adjacent 
to  and  between  the  word  candidates  found  by  this  initial  scan,  looking  for  strings  of  sir  11  func¬ 
tion  words,  and  lor  long  word3  which  are  more  garbled. 

Du.ung  these  two  stages,  no  attempt  is  mad.,  to  build  any  syntactic  structure.  Only  very  lo- 
<.  lex1  ’  ana  boundary  criteria  are  used  in  the  higher-distance  matching  described  here.  The 
final  o\  ‘  of  this  processing  i.«  a  set  of  word  candidates  recognized  ir.  all  parts  of  the  sentence. 
Many  u.  words  may  be  wrong,  ma.iv  may  overlap.  Also,  there  may  be  gaps  in  the  utterance 

which  are  covered  by  any  word  candidate.  The  goal  of  this  initial  matching  is  to  uncover  most 

of  the  words  in  the  sentence.  These  words  are  then  turned  over  to  higher-level  parts  of  LPARS 
which  continue  the  analysis  in  a  more  comprehensive,  systematic  fashion. 


input 


INITIAl  ! 

word 

HIGHER-LEVEL 

SCAN  ! 

i 

^  candidates 

PART  OF  LPARS 

LOCAL 
HIGHER-DISTANCE 
MATCHING 


I-  iff.  1V-1. 


Figure  1V-1  shows  how  these  two  stages  fit  into  the  overall  LPARS  framework. 

A.  INITIAL  SCAN 

When  processing  an  utterance,  LPARS  first  makes  a  scan  through  the  entire  utt  ranee  at  a 
fairly  low  phonetic  distance.  This  scan  searches  only  Tor  content  '<•  ,  (which  in  LPARS'  vocab¬ 

ulary  are  all  at  least  4  phonemes  in  length). 

The  scan  is  maoe  by  matching  the  spelling  of  each  word  against  the  input,  starting  at  every 
possible  phoneme  position  in  the  utterance.  The  scan  returns  a  set  of  word  candidates,  each  of 
which  consists  of  a  proposed  word,  its  starting  and  ending  phonemes,  and  its  total  phonetic 
distance. 

The  following  example  shows  the  result?  of  scanning  a  scrambled  sentence  in  this  manner, 
at  a  number  of  different  phonetic  distances. 

1 .  Exam  pie 

""he  sentence  being  scanned  in  this  example  is  "The  green  coffeetable  supports  the  ashtray". 
Th.s  sentence  is  scrambled,  and  then  scanned  at  a  number  of  phonetic  distances.  (The  LPARS 
Vocabulary  is  given  in  Appendix  H.  All  content  words  are  at  'oast  4  phonemes  in  length.  The 
only  unusual  feature  of  the  vocahulary  is  that  some  word  strings  such  as  "on  tr.p  of"  and  "on  the 
left  of'  are  treated  as  single-word  prepositions.  ) 


34 


c  \N  1  riuiietic  Distance  -  15,  Minimum  Word  Length  -  4 


((CLUTTER  1  4  60)  (COFKELTABLE  7  14  90)) 

This  returns  a  list  of  word  candidates  of  the  following  form 

(WORD  START-PHONEME  END-PHONEME  TOTAL- DISTANCE ) 

The  word  "coffeetable",  for  instance,  has  been  found  between  phonemes  7  and  14  at  a  total 
distance  of  90.  Sit.ce  the  threshold  distance-per-phoneme  of  :he  scan  i3  15,  no  word  is 
found  at  a  total  clionetic  distance  greater  than  15  *  its  wordlength. 

SCAN  2-  Phonetic  Distance  =  20,  Minimum  W  rd  Ltngtii  =  4 

((CLUTTER  1  4  60)  (COFFEETABLE  7  14  90)  (CLUTTER  li  17  85) 

(ROBERT  14  18  1 05 )  (CLUTTER  16  19  95)  (CONTAIN  20  25  115) 

(DICTIONARY  21  28  145)) 

SCAN  3:  Phonetic  Distance  -  30.  Minimum  Word  Length  -  4 

((ROBERT  1  ’  460)  (CLUTTER  1  4  60)  (PERSON  1  5  170) 

(FRIENDLY  1  5  155)  (GREEN  3  5  1 00)  (COFFEETARLE  7  14  90) 

(CONTAIN  9  13  170)  (CABINET  8  15  200)  (INFRONTOF  10  18  250) 
(TABLELAMP  11  16  21 0)  (ONTHELEFTOF  12  18  280) 

(CLUTTER  13  17  85)  (ROBERT  14  18  105)  (INFRONTOF  14  22  270) 
(ENTRANCF  14  19  175)  (CHRISTOPHER  16  22  205) 

(CLUTTER  16  19  95)  (PI.ACE  16  19  1 1 5 )  (FOOTSTOOL  16  22  195) 
(TELEVISION  16  22  195)  (INBACKOi  17  22  210)  (NEXTTO  17  21  175) 
(CABINET  18  23  200)  (PERSON  18  22  170)  (CONTAIN  20  25  115) 

(GREEN  20  22  95)  (DICTIONARY  21  28  145)) 

Notice  that,  as  the  phonetic  distance  threshold  increases,  the  number  of  words  found  in¬ 
creases  dramatically. 

SCAN  4:  Phonetic  Distance  25,  Minimum  Word  Length  2 

((CLUTTER  1  4  60)  (FRIENDLY  1  5  155)  (THE  1  2  0) 

(THE  2  2  15)  iGREKN  3  6  100)  (THE  3  4  5)  (BY  3  5  75) 

(THE  4  4  20)  (COFFEETABLE  7  14  90)  (IN  10  11  45) 

(ON  12  13  40)  (CLUTTER  15  17  65)  (ROBERT  14  18  105) 

(WHICH  14  16  75)  (ENTRANCE  14  19  175)  (CLUTTER  16  19  9r) 

(TELEVISION  16  22  195)  (THE  16  17  45)  (CONTAIN  20  25  1  15) 

(GREEN  20  22  95)  (THE  20  21  5)  (DICTIONARY  21  28  145) 

(THE  21  21  20)  (THE  23  24  40)  (ON  26  27  40,  (WHICH  27  29  10) 

(WHAT  27  2  751  (WHICH  28  29  25)) 
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This  scan  is  made  looking  for  words  of  length  2  or  greater,  and  illustrates  why  it  makes 
sense  to  oirk  up  small  function  words  using  context,  rather  than  scanning  for  them  in  the 
initial  scan,  A  large  number  of  such  words  is  found,  most  of  them  wrong. 

2.  Two  Possible  Extensions 

a.  Using  Inter-utterance  context 

One  interesting  extension  of  the  initial  scan  described  above  is  to  use  semantic  constraints 
of  c  global  nature  to  restrict  the  worcs  initially  searched  for.  This  approach  is  very  much  in 
i  eeping  with  the  vertical  design  of  LPARS. 

In  an  interactive  speech  system,  if  the  previous  context  of  the  dialog  can  substantially  re¬ 
strict  the  words  that  are  likely  to  be  uttered  at  particular  times,  tnen  only  those  words  need  be 
included  in  the  initial  scan.  Only  if  the  results  of  this  scan  proved  unsuccessful,  need  one  per¬ 
form  the  initial  scan  using  a  larger  part  of  the  vocabulary. 

This  extension  would  use  inter-utterance  contextual  information  in  a  vertical  fashion  to  re¬ 
duce  the  average  amount  of  computation  required  p“r  utterance.  LPARS  presently  does  not  use 
conte..t  of  this  sort.  Designing  a  system  which  uses  inter-utterance  context  in  an  interesting, 
meaningful  fashion  is  an  extensive  project  in  its  own  right. 

b.  Using  Supra-segmental  Information 

One  of  the  inefficiencies  inherent  in  LPARS’  design  is  the  necessity  of  matching  every  word 
at  every  phoneme  position  in  the  utterance.  There  is,  however,  a  great  deal  of  supra-segmental 
information  available  (see  Refs.  20  and  21)  in  the  waveform  which  might  help  indicate  where 
stressed  syllables  are,  where  phrase  boundaries  are,  etc.  Such  information  could  be  very  use¬ 
ful  in  helping  a  locally  organized  system  like  LPARS  restrict  the  amount  of  lexical  processing 
dene  in  the  initial  scan. 

B.  LOCAL  HIGHER- DISTANCE  MATCHING 

The  words  recognized  in  the  initial  scan  are  turned  over  to  LOCALMATCH,  a  routine  that 
directs  higher-distance  matching  in  areas  adjacent  to  and  between  these  words.  The  set  of  local 
scans  is  described  in  Appendix  E. 

Although  the  LOCALP’ATCH  routine  ould  have  been  written  to  operate  automatically  from 
an  arhitr"'"'  transition  net,  it  was  written  procedurally  instead  from  an  inspection  of  the  particu¬ 
lar  gx  used  by  LPARS.  Section  rV-C  discusses  some  possible  advantages  of  allowing  a 

speech-system,  designer  tne  flexibility  of  pro 'edurally  expressing  the  logic  which  directs  such 
local  word  searching. 

Tlie  type  of  logic  performed  by  LOCALMATCH  is  quite  simple,  due  in  part  to  the  relative 
simplicity  of  LPARS’  syntax  and  semantics.  Typical  actions  (all  performed  at  a  higher  phonetic 
distance  than  the  initial  scan)  include- 

(1 )  Matching  for  semantically  acceptable  adjectives  to  the  left  of  nouns  (and  vice  versa: 
matching  for  semantically  acceptable  nouns  to  the  right  of  adjectives). 

(2)  Matching  for  verbs  to  the  right  of  nouns  (and  vice  versa). 

(3)  Matching  for  prepos!tions  to  the  right  of  verbs  and  nouns,  and  for  determi’iors  fol¬ 
lowed  by  adjectives  or  nouns  to  the  right  of  verbs. 
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(4)  Mashing  for  small  function-word  strings  such  as  "in  the",  "on  the",  "which  the", 
"in  which  the",  "wltothe",  etc.,  as  appropriate,  between  two  nouns  or  a  noun  and 
an  adjective  which  are  separated  by  a  small  number  of  phonemes. 

(5)  Matching  for  function-word  strings  such  as  "the",  "it.  the",  "what  is  the",  etc. 
between  the  beginning  of  the  sentence  and  a  noun  or  adjective  which  starts  close  to 
the  beginning  of  the  sentence. 

The  LOCALMATCH  routine  takes  advantage  of  the  word  boundary  information  obtained  from 
the  initial  scan,  together  with  local  syntactic  constraints  of  the  grammar.  Thi3  information  is 
used  to  direct  searches  for  small  function-word  strings  that  might  be  difficult  to  find  without 
context,  and  to  find  more  garbled  longer  words  next  to  words  already  found. 

1.  Example 

The  following  is  an  exai  of  an  initial  scan  and  subsequent  high-distance  matching.  The 
sentence  being  processed  is  i  he  wastebasket  which  the  green  coffeetable  supports  contains  the 
dictionary". 

Initial  Scan  Output 

((STEPLADDER  8  15  150)  (COFFEETABLE  18  25  15)  (SUPPORT  26  31  5) 

(CONTAIN  33  38  1 0)  (PICTIONARY  42  49  95)) 

LOCALMATCH  Output 

((CONTAIN  1  5  135)  (THE  1  2  0)  (PLACE  3  5  00) 

(WASTEBASKET  3  10  185)  (THE  6  7  0)  (STEPLADDER  S  15  150) 

(WHICH  11  12  12)  (CLUTTER  11  14  100)  (THE  13  14  12) 

(THE  15  17  0)  (GREEN  15  17  60)  (COFFEETABLE  18  25  15) 

(SUPPORT  Z..  31  5)  (CLUTTER  30  32  110)  (ROBERT  30  33  120) 

(THE  32  33  0)  (THE  33  j3  0)  (CONTAIN  33  38  1C) 

(BOOKCASE  34  38  HO)  (ENJOY  34  38  120)  (THE  39  41  0) 

(PERSON  42  46  105)  (DICTIONARY  42  49  95)) 

2.  Semantics  in  the  Higher- Distance  Matching 

The  LOCALMATCH  routine  does  not  use  semantics  very  extensively  to  help  it  restrict  the 
various  words  that  it  matches.  However,  there  are  a  few  instances  in  which  semantics  are  used, 
such  as  when  it  is  looking  for  adjectives  in  front  of  a  noun,  it  need  only  look  for  adjectives  that 
are  semantically  compatible  with  that  noun. 

In  general,  however,  these  scans  are  based  on  such  local  information  *hat  it  is  impossible 
to  incorporate  too  many  semantic  restrictions.  For  instance,  in  scanning  ior  a  verb  next  to  a 
noun,  one  cannot  require  that  the  verb  be  able  to  take  the  noun  as  a  subject,  for  inctance,  since 
the  noun  might  be  nested  in  prepositional  phrases  or  clauses,  modifying  the  subject. 

As  a  result,  the  information  being  exploited  by  the  LOCALMATCH  routine  is  primarily  the 
word  boundary  information  together  with  some  very  low-level  syntactic  constraints,  rather  than 
higher-level  syntactic  or  semantic  information. 
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If,  however,  the  semantic  component  of  LPARS  was  extended  to  include  a  scene  with  objects 

4 

being  talked  about  (as  in  Winograd's  block  world  ),  and  to  include  an  interacthe  dialog  with  pre¬ 
vious  events  which  could  be  referred  to  (in  modifying  clauses),  the  semantic  pa»t  of  this  lexical 
matching  could  be  made  much  more  useful  and  interesting  as  discussed  below. 

C.  SOME  SPECULATION 

We  now  discuss,  in  a  somewhat  speculative  vein,  some  reasons  why  the  1  OC'Al  MATCH 
routine  in  a  system  like  LPARS  might  be  more  conveniently  expressed  proccdurally,  rather  than 
written  to  work  automatically,  given  any  transition  net  grammar.  These  argume  fs  would  be 
particularly  valid  in  a  system  with  a  much  more  complex  semantic  component  which  made  use 
of  a  scene  being  talked  about,  as  mentioned  above. 

The  crucial  point  is  that  the  goal  of  the  LOCALMATC11  routine  is  riot  to  systematically  ex¬ 
plore  the  utterance  in  a  globally  consistent  fashion.  Rather,  the  goal  is  to  use  locally  available 
words,  together  with  a  knowledge  of  objects  and  events  being  talked  about,  to  try  to  identify 
fairly  local  structures  in  the  utterance  describing  these-  ohjects  and  events. 

The  semantic  gic  that  might  be  useful  in  i,uch  a  routine  is  considerably  different  from  the 
syntactic  constraints  built  i  nto  a  grammar.  If  so,  it  may  not  he  necessary  (or  convenient)  to  in¬ 
corporate  the  semantic  logic,  which  postulates  and  searches  for  loral  structures  in  an  utterance, 
wholely  within  the  syntactic  parsing  framework.  We  do  not  say  that  it  would  be  impossible  to  do 
so,  just  that  it  might  introduce  a  level  of  formalism  (transition  net)  which  is  largely  unnecessary 
for  some  types  of  semantic  logic  and  would  therefore  be  an  inconvenient  mode  for  expressing 
that  logic. 

1.  Example  1:  Recognizing  Phrases  and  Clauses 

Suppose  a  routine  that  would  help  search  for  noun  phrases  is  wanted.  Such  a  routine  might 
be  called  whenever  the  'cxical  analyzer  discovered  an  aojectjve  or  noun.  The  routine  could  use 
knowledge  about  which  adjectives  and  nouns  could  modify  one  another  in  *!v  task  domain  to  drive 
further  lexical  matching  to  try  to  fill  out  -n  entire  noun  phrase. 

This  routine  might  also  query  the  particular  scene  being  discussed  to  see  what  objects  are 
possihly  heing  described.  The  routine  might  search  the  scene,  obtain  a  list  of  possible  ohjects, 
and  if  it  were  reasonable  to  assume  that  a  determined  noun  group  was  heing  processed,  it  could 
diced  scans  for  additional  words  that  might  also  describe  the  object  or  that  might  help  uniquely 
specify  which  ohjcct  was  being  talked  about. 

lxigic  of  this  sort  could  presumably  use  the  transition  network  grammar  to  direct  these 
scans.  On  the  other  hand,  these  scans  have  such  a  .ocai,  specific  character  that  the  transition 
net  would  he  hasically  superfluous. 

The  reason  for  this  is  twofold 

(a)  A  noun  phrase  is  a  very  locai  structure  and  appears  unchanged  in  many  global  contexts. 

(b)  The  logic  being  used  to  iircei  the  scans  is  highly  specific;  it  applies  only  to  nt  un 
phrases  and  does  not  generalize  to  cover  many  different  parts  of  speech.  On  the 
other  hand,  an  advantage  of  a  syntactic  formalism  like  the  transition  net  grammar 
is  that  it  treats  all  words  in  a  systematic,  homogeneous  fashion.  This  is  not  true 
of  the  logic  just  described.  One  invokes  the  noun-phrase  recognizer  only  when  one 
has  found  a  particular  class  of  word.  One  knows  exactly  what  to  do  next.  There  is 
no  need,  for  instance,  having  found  a  noun,  to  use  the  transition  net  to  discover  that 
an  adjective  can  modify  it. 
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A  similar  argument  would  liold  for  prepositional  phrases.  Even  when  one  considers  more 
complex  syntactic  structures  such  as  modifying  clauses,  the  arguments  art  probably  still  quite 
valid.  For  instance,  if  the  system  has  stored  a  number  of  events  (i.e.,  X  did  Y,  X  did  Y  to  Z) 
it  might  be  desirable,  having  found  descriptions  of  X  and  Y,  to  direct  scans  to  see  if  somehow 
these  structures  could  be  incorporated  into  a  relative  clause  describing  the  event  stored.  Here 
again,  one  is  looking  for  specific  things,  and  it  might  well  be  easier  io  express  this  type  of  logic 
procedurally,  divorced  from  the  formalism  of  the  transition  network  grammar. 

2.  Example  Searching  for  Small-Word  Strings 

Especially  with  tne  cr'jdity  of  front-end  output,  small  words  pose  a  very  real  problem  to  a 
speech  r  ;cognizer,  especially  when  a  number  of  them  can  occur  together. 

As  an  example,  consider  the  string  of  words  "is  in  the".  If  twi  nouns  (Ml  and  N2)  have  been 
discovered,  how  is  one  to  test  the  possibility  of  the  string  of  small  words  being  present  between 
them'5  Using  the  transition  net  grammar,  one  would  ask:  "Can  Ni  be  followed  bv  a  small  verb, 
which  can  be  followed  by  a  small  preposition,  which  can  be  followed  by  a  small  determiner, 
which  can  be  followed  by  N2?"  This  involves  a  great  deal  of  threading  one  way  through  the 
grammar,  with  very  little  lexical  information  to  base  any  parsing  choices  upon. 

It  would  be  more  convenient  to  write  a  program  which  scanned  for  the  word-string  "is  in 
the"  as  a  unit,  whenever  it  observed  that  two  nouns,  N’l  and  N2,  are  x  *  y  phonemes  apart  and 
that  N1  can  be  "in"  N2.  The  answer  to  this  question  is,  of  course,  not  obvious  and  is  probably 
dependent  on  the  particular  vocabulary  and  svniu  '  used,  on  the  application,  and  on  the  accuracy 
of  the  front  end. 

3.  What  Use  Then  Is  the  Syntactic  Formalism':’ 

Sections  1  and  2  above  offered  some  arguments  as  to  why  fairly  local  matching  might  best 
be  done  outside  the  transition  network  formalism.  If  these  somewhat  speculative  arguments  do 
make  sense,  however,  then  it  is  reasonable  to  ask  what  such  syntactic  formalism  is  good  for,  if 
anything.  The  answer  is  that  syntactic  formalism  is  needed  to  assure  global  consistency  of  lo¬ 
cal  structures  found  in  the  manner  described  above. 

Since  the  logic  described  is  based  on  fairly  local  criteria  and  is  basically  a  set  of  probabilis  ¬ 
tic  heuristics  which  looks  for  reasonable  local  constructs.  Lhere  is  no  assurance  that  recognitic” 
going  on  in  one  part  of  the  sentence  is  globally  consistent  with  recognition  elsewhere.  The  tran¬ 
sition  network  can  be  used  to  assure  that  the  entire  str’  ture  is  globally  consistent,  and  to  direct 
attention  to  possible  problem  areas  (bringing  information  from  all  parts  of  the  utterance)  if  a 
well-defined  sentence  is  not  initially  discovered.  The  primary  focus  uf  the  present  work  is,  in 
fact,  on  the  problem  of  how  to  organize  a  system  which  uses  syntax  (expressed  in  the  form  of  a 
transition  network)  (n  this  fashion. 
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\  \  l.(K  \!  APPROACH  TO  PAHANG 


a.  i.vrnoDt  c  riox 

This  sci  in  describes  the  two  local  parsing  algorithms  used  by  I, PAHS;  TREK,  which 
huilds  the  local  partial  parse  trees;  and  CONNECT,  which  connects  pairs  of  such  trees  together 
liv  proposing  words  that  might  exist  between  them.  The  description  of  these  two  algorithms  has 
be  n  kept  at  the  level  of  a  verbal  o-  review  in  this  chapter.  More-detailed  flowchart  descriptions 
are  given  in  Appendix  I). 

Both  algorithms  described  here  involve  only  left-to-right  processing  of  the  grammar,  even 
though  thev  are  used  as  part  of  a  locally  organized  parsing  system. 

This  is  the  ntost  technical  section  of  the  report  A  reader  interested  in  acquiring  a  back¬ 
ground  of  this  material  without  tor  before)  going  into  more  detail  is  advised  to  read  Secs  B-l, 
C-1,  C-d,  I)-1,  and  D-l-a  which  give  an  overview  of  the  two  algorithms  described. 

We  first  describe  the  structure  of  the  partial  parse  trees  which  form  a  central  part  of 
l.PARS  processing 


B.  REPRESENTATION  OT  PARTIAL  PARSE  TREES  <  PPTs) 


This  section  describes  how  I. PARS  represents  syntactic  structures.  It  is  assumed  that  the 
reader  has  some  general  exposure  to  syntactic  representation. 

A  PPT  is  composed  of  termir.  ar.d  nonterminal  nodes.  Each  terminal  node  corresponds 
to  a  terminal  arc  in  the  transition  net,  r.nd  each  nonterminal  node  corresponds  to  a  nonterminal 
arc.  Each  nonterminal  node  must  have  one  or  more  sons.  The  sequence  of  these  sons  corre¬ 
sponds  to  some  permitted  path  through  the  no. work  which  defines  that  nonterminal  4  number 
of  simple  PPTs  are  shown  helow  (Fig.  V-l). 
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Fig.  V-l. 

A  PPT  is  const  -ucted  out  of  words  that  are  adjac-nt  in  the  input.  It  can  span  no  gaps.  It  is 
"partial"  in  the  sense  tha*  the  outer  end  of  a  path  of  arcs  taken  through  an  outer  nonterminal  node 
need  not  terminate  on  a  permitted  state. 
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Two  points  arc  central  to  the  use  of  PPTs  in  l.PARS: 

11)  Kach  PPT  is  vert  closely  tied  to  the  grammar. 

(Z)  "Ancestor  links"  can  be  used  to  represent  the  local  ambiguity  inherent  in  ght 
rec  ursion. 

These  two  points  are  expanded  upon  in  the  following  two  sections. 

1.  Each  PPT  is  Closely  1. inked  to  the  Grammar 

To  understand  how  the  local  parsing  algorithms  ope -ate  (especially  the  CONNECT  algorithm), 
one  must  appreciate  how  closely  the  PPTs  are  tied  to  the  transition  network  grammar.  The 
sequence  of  sons  of  each  node  represents  some  permissible  path  of  arcs  through  the  transition 
network  v.hieh  defines  that  nonterminal.  This  path  of  ares  is  contained  explicitly  in  each  node. 

In  fact,  each  node  includes  the  following  information: 

(a)  a  list  of  sons 

(b)  a  list  of  arcs  taken  (i.e.,  the  path  through  the  network! 

(e)  the  stan  state  (of  the  path) 

f d)  the  end  state  (of  the  path). 

Tnus,  given  a  PPT,  one  can  get  directly  to  the  beginning  and  end  states  the  path  takes  through 
the  transition  network  in  forming  each  nonterminal  node.  Therefore,  one  can  readily  investigate 
arcs  emanating  from  an  end  state,  for  instance,  to  see  hmv  the  path  might  be  extended.  This  is 
an  important  capability,  since  the  CONNECT  algorithm,  as  discussed  in  Sec.  V-l),  consists  basi¬ 
cally  of  exploring  ways  in  which  the  paths  of  two  ’’PTs  can  he  extended  and  joined. 

Since  the  PPT  is  so  closelv  tied  to  the  grammar,  we  often  talk  of  the  beginning  or  end  state 
of  a  node,  meaning  the  beginning  or  end  state  in  the  grammar  of  the  path  of  ares  taken  in  forming 
the  node. 

Z.  l.PARS'  PPTs  vs  Woods'  Structure  Huilding 

The  parse  structures  described  here  differ  from  those  constructed  by  Woods'  transition  net 
parser.  ^  In  Woods'  system,  syntactic  structure  is  built  by  user-written  BIT1..X)  functions. 
These  usor-huilt  structures  represent  a  kind  of  deep  structure  rather  than  the  path  taken  through 
the  grammar.  In  l.PARS,  no  attempt  is  made  to  construct  deep  structure  as  Woods'  system 
does.  The  only  parse  structures  built  are  created  dirertly  from  the  transition  net  as  illustrated 
in  Fig.  V-l 

3.  Ancestor  Links 

The  only  unusual  feature  of  the  PPTs  produced  by  TREE  is  the  use  of  "ancestor  links." 
Ancestor  links  are  used  to  express  the  local  ambiguity  caused  by  the  existence  of  right  recursion 
in  a  grammar. 

For  instance,  assume  that  noun  phrase  (NP)  and  prepositional  phrase  (PP)  are  defined  as 
shown  in  Fig.  V -Z. 

This  is  a  simple  example  of  right  recursion  since  a  NP  can  end  in  a  TP  which  can  end  in  a 
NP,  etc.  Thus,  if  one  sees  only  a  noun  phrase  to  the  left  <  f  a  verh,  for  instance,  one  does  not 
know  if  that  noun  phrase  is  the  suhject  of  the  verb,  or  if  it  is  arbitrarily  nested  in  PPs  modifying 
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Fig.V-2. 

the  subject.  Thus,  the  string  of  words  "the  box  supports"  could  be  part  of  .  v»h’ch  the  box 
supports.  .  ",  or  "the  block  in  the  box  supports.  .  ",  or  "the  cube  on  the  block  in  the  box  sup¬ 
ports.  . ",  etc. 

If  ancestor  links  were  not  used,  there  would  be  an  infinite  number  of  ways  to  connect  a  NP 
to  a  verb,  including  these  shown  in  Fig.  V-3. 
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Fig.  V-3. 


Using  ancestor  links,  however.  whenever  a  NP  is  found  to  the  left  of  a  verb,  the  two  struc¬ 
tures  shown  in  Fig.  V-4  can  be  produced. 
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Fig.  V-4. 

The  starred  link  in  PPT  2  is  an  ancestor  link,  and  embodies  an  unlimited  number  of  possible 
global  relationships.  The  ancestor  link  implies  that  the  NP  is  the  rightmost  descendant  of  the 
NP  arc  i'.i  the  SENT  net,  without  specifying  the  exact  relationship.  Notice  that  if  the  noun  phrase 
wap  not  a  semantically  valid  subject  of  the  verb,  'only  PPT  2  need  be  produced. 

If  only  a  noun  had  been  found  to  the  left  of  a  verb,  only  one  structure  need  be  produced  (see 
Fig.  V-5) . 

SENT 

/X  \\ 

noun  verb 


Fig.  V-5. 
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More  formally,  an  ancestor  link  represents  an  unspecified  chain  of  nonterminal  nodes  con¬ 
necting  a  PPT  to  a  higher-level  node.  (A  chain  of  NTs  is  one  or  more  NT  directly  descendant 
from  one  another,  each  having  onlv  one  son.)  The  presence  of  at  ancestor  link  means  that  a 
chain  of  nonterminal  nodes  has  been  deleted  and  replaced  by  the  ancestor  link.  As  illustrated 
in  Fig.  V-5,  this  allows  a  single  ancestor  link  to  represent  an  infinite  number  of  possible 
structures. 

An  ancestor  link  can  be  thought  of  as  saying  "1  don't  know  what  is  to  the  left  of  me.  Manv 
structures  are  possible."  Thus,  such  links  can  occur  only  on  the  leftmost  side  of  a  partial  parse 
tree.  (If  left  recursion  were  allowed  in  LPARS'  grammar,  then  ancestor  linko  could  also  appear 
on  the  rightmost  side  of  a  PPT.)  TREE  automatically  inserts  an  ancestor  link  whenever  a  chain 
of  nonterminal  nodes  occurs  in  the  leftmost  son  of  a  PPT  being  constructed. 

The  concept  of  an  ancestor  link  is  a  very  simple  one.  It  is  also  a  very  natural  one  in  a  sys¬ 
tem  such  as  LPARS.  Such  links  allow  the  system  to  recognize  a  valid,  yet  ambiguous,  syntactic 
relationship  without  being  forced  to  specify  what  the  relationship  is.  Later,  when  an  attempt  is 
made  to  connect  the  PPT  to  other  PPTs,  the  relationship  can  be  resolved. 

More-complex  PPTs  containing  several  ancestor  links  ca  i  also  be  made.  For  example, 
given  the  string  of  words  "the  sofa  supports  contains  the  ashtray  endch",  one  of  the  possible 
PPTs  produced  is  shown  in  Fig.  V-6. 
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as  in:  "The  box  which  the  table  beside  the  sofa 
supports  contains  the  ashtray") 

Fig.  V-6. 

Since  I.PARSonly  allows  right  recursion,  these  ancestor  links  can  occur  only  on  the  leftmost 
side  of  a  PPT.  Were  left  recursion  allowed,  such  links  would  also  be  possible  on  the  rightmost 
side  of  a  PPT. 

4.  Partial  Parse  Tree  Terms 

We  now  define  a  rew  terms  useful  in  talking  conciseiv  about  partial  parse  trees. 

A  "Final  Node":  A  nonterminal  node  of  a  PPT  is  a  final  node  (a)  if  the  path  of  arcs 
taken  ends  on  a  permissible  final  state  and  (b)  if  its  rightmost  son  is  either  ter¬ 
minal  or  is  itself  a  final  node. 

An  "Initial  Node":  A  nonterminal  node  of  a  PPT  is  an  initial  node  (a)  if  the  path  of 
arcs  taken  begins  on  a  permissible  initial  state,  and  (b)  if  its  leftmost  son  is  either 
terminal  or  is  itse.f  an  initial  node. 
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To  11  Invoke”:  An  arc  in  the  grammar  "invokes"  a  word  if  it  is  labeled  bv  the  syn¬ 
tactic  class  of  the  word.  It  invokes  a  PPT  if  it  is  labeled  by  the  syntactic  class  of 
the  top  node  of  the  PP  T. 

A  "Context  Arc":  An  arc  of  the  grammar  is  a  possihle  context  a  for  a  PPT  if  it 
invokes  the  PPT. 


C.  TREE:  THE  PARTIAL  PARSE  TREE  CONSTRUCTOR 

This  section  describes  TREE,  the  algorith..i  which  constructs  partial  parse  trees  in  various 
parts  of  the  input  sentence.  In  spirit,  the  algorithm  is  quite  Similar  to  more  traditional  left-to- 
right  parsing  algorithms.  A  nuniher  of  additional  considerations  are  made  necessary,  however, 
by  the  local  nature  of  the  I  PARS  noproach. 

The  TREE  algorithm  itself  does  not  attempt  to  recognize  or  hypothesize  any  words  in  the 
input.  Rather,  it  accepts  word  candidates  which  have  been  recognized  by  previous  activity  of 
LPARS,  and  constructs  as  many  different  local  parse  structures  as  possihle  using  these  words. 

If  all  the  words  of  the  inout  sentence  have  been  found,  then  a  parsing  of  the  entire  sentence  is 
among  the  parse  structures  produced.  If  not,  TREE  will  output  a  collection  of  partial  parse 
trees  (PPTs)  which  are  then  turned  over  to  a  second  algorithm,  CONNECT.  CONNECT  attempts 
to  connect  pairs  of  parse  trees  together  by  using  the  grammar  to  propose  words  that  might  exist 
between  them.  It  then  tests  these  words  out  at  high  phonetic  distance  against  the  input  (see 
Fig.  V-7). 
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Fig.  V-7. 

We  first  present  an  overview  of  the  operation  of  the  TREE  algorithm.  Then  we  discuss  two 
problems  which  do  not  arise  in  left-to-right  parsing  systems,  but  which  must  be  dealt  with  to 
implement  the  local  approach  of  LPARS. 


1.  Goal  of  TREE 

The  goal  of  the  TREE  algorithm  is  to  construct  as  many  different  local  syntactic  structures 
as  possihle  using  the  input  words.  Let  us  see  what  this  means  in  practice.  Suppose  the  word 
string  "the  big  man"  was  recognized.  Clearly,  these  words  could  all  be  incorporated  into  the 
PPT  shown  in  Fig.  V-8. 


NP 

s?\ 

the  big  man 


Fig.  V-8. 

This,  in  fact,  is  the  only  PPT  which  LPARS  creates.  Notice,  however,  that  there  are  five 
other  "subsumed"  structures  which  can  be  formed  (Fig.  V-9). 

A  PPT  (PPT  1)  is  "subsumed"  by  another  PPT  (PPT  2)  if  PPT  1  incorporates  only  a  subset 
of  the  words  incorporated  by  PPT  2,  and  if  the  syntactic  structure  connecting  those  w'ords  is 
identical  in  the  two  PPTs. 
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Fig.  V-9. 


It  might  seem  reasonable  for  TREE  to  create  all  possible  subsumed  structures,  on  the 
theory,  in  the  example  above  for  instance,  that  "the"  and  "big"  might  be  correct  words  but  "man" 
might  have  been  in  error.  The  approach  of  TREE,  however,  is  not  to  produce  any  PPT  which 
is  completely  subsumed  by  some  other  PPT  which  t  constructs.  If  any  word  is  later  thought  to 
be  in  error,  EPARS  can  break  the  structure  down  into  smaller  parts. 

Thus,  the  goal  of  TREE  is  to  construct  as  many  local  PPTs  as  possible,  with  the  provision 
that  no  PPT  produced  is  subsumed  by  another.  For  instance,  given  "the  book  beside  the",  TREE 
would  produce  the  four  structures  shown  in  Fig.  V-10. 
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Fig.  V-10. 


These  four  valid  global  interpretations  of  the  words  are  all  structurally  different.  Notice  that 
PPT  2  does  not  include  the  word  "the",  since  only  a  relative  pronoun  (such  as  "which")  can  follow' 
the  preposition  in  the  SENT  transition  net.  Notice  also,  that  PPT  4  really  represents  a  potentially 
unlimited  number  of  global  relationships  because  of  the  ancestor  link. 

As  discussed  above,  the  goal  of  TREE  is  to  construct  all  possible  PPTs.  Another  way  of 
phrasing  this  goal  is  to  say  that  TREE  hypothesizes  all  possible  global  relationships  that  might 
exist  in  the  grammar  between  all  sets  of  adjacent  words. 

It  is  reasonable  to  ask  why  a  system  should  go  through  this  intermediate  step  of  constructing 
all  possible  PPTs.  The  reason  is  that  once  these  PPTs  are  built,  LPARS  can  examine  them  in 
toto,  and  decide  vrhrt  ar°as  of  the  input  are  best  candidates  for  further  investigation  at  high 
phonetic  distance 
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2  Implementation  Overview 


The  following  is  a  description  of  the  TREE  algorithm  in  general  terms,  to  give  an  overview 
of  the  issues  involved.  We  hope  that  this  overview  will  suffice  for  most  readers. 

TREE  accepts  as  inpu  a  list  of  words,  possibly  overlapping,  ordered  by  their  position  in 
the  sentence.  It  processes  these  one  at  a  time,  thus  proceeding  from  left-to-right  through  the 
sentence.  Since  at  each  point  TREE  builds  all  possible  parses,  no  backup  is  necessarv. 

TREE  stores  the  PPTs  it  constructs  in  a  "phoneme  position  table"  (Fig.  V-lt)  which  has  one 
cell  for  each  phoneme  position  in  the  input  sentence.  Each  cell  has  a  list  of  all  PPTs  constructed 
which  end  .t  that  phoneme  position  in  the  sentence. 


Fig.  V-li.  Phoneme  position  table. 


Let  us  take  the  following  simple  example  to  see  how  the  table  is  u.  ,.  Suppose  the  following 
word  candidates  were  found:  (the  li  12)  (big  13  15)  (man  16  18).  The  numbers  represent 
the  beginning  and  ending  phoneme  positions  of  each  word  in  the  input.  TREE  takes  each  word 
in  turn  and  attempts  to  connect  it  to  even  l’PT  ending  immediately  before  that  word. 

Thus  (the  11  12)  is  taken  first.  Since  no  PPTs  end  at  position  10,  no  connections  can  be 

made,  a  the  PPT  "the"  is  recorded  at  position  12.  Next  (big  13  1  5)  is  taken;  it  is  connected 
to  "the",  and  a  PPT  "the  big"  is  recorded  at  position  15.  At  this  point,  the  PPT  "the"  is  flagged 
to  indicate  that  it  is  a  subsumed  PPT.  Similarly,  "man"  is  connected  to  "the  big",  yielding 
"the  big  man"  ending  at  position  18,  and  the  PPT  "the  big"  is  flagged  as  subsumed.  The  result 
of  this  activity  is  shown  in  Fig.  \  -12. 


Fig.  Y-l 2. 


Notice  that  once  a  P"  T  which  has  been  incorporated  into  a  larger  structure  is  marked  as 
subsumed,  even  though  it  will  not  be  part  of  the  final  output  of  TREK,  it  is  kept  in  the  table. 
Therefore,  if  an  overlapping  word  candidate  has  been  found,  such  as  (manufacturer  16  24),  it 
can  also  be  connected  to  "the  big"  yielding  the  PPT  "the  big  manufacturer"  ending  at  position  24. 

This  example  is  a  very  simple  one.  In  practice,  the  input  being  processed  may  contain 
several  words  in  any  section  of  input.  Words  may  overlap  one  another  at  random.  Alternatively, 
gaps  may  exist  (if  they  do,  of  course,  r.o  syntactic  connections  are  mad  '1  across  ese  gaps). 

Notice  also  that  connecting  a  word  to  a  single  previously  made  PPT  may  vield  several  PPTs. 
As  illustrated  in  Fig.  V-8,  for  instance,  adding  "beside"  to  "the  book"  yields  four  different  PPTs. 
Whenever  a  word  is  processed  by  TREE,  it  is  connected  ir  as  many  different  ways  as  possible 
to  al?  PPTs  which  end  directly  before  that  word.  In  addition,  as  discussed  below  in  Sec.  3-f, 
TREE  looks  to  sec  if  the  word  can  he  connected  to  a  rightmost  subpm  i  of  each  PPT  in  some  other 
syntactic  context. 

Before  adding  a  PPT  to  the  appropriate  phoneme  position  licit,  of  co  irse,  TREE  first  checks 
to  make  sure  it  is  not  subsumed  by  any  previously  created  PPT  already  on  the  list.  Similarly, 
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if  the  new  PPT  itself  subsumes  any  PPT  already  on  the  list,  the  previous  PPT  is  flagged.  These 
two  cheeks  are  necessary  foi  reasons  described  below  in  Sec.  3-f. 

3.  Making  Syntactic  Connections 

1 

There  are  three  different  ways  in  which  syntactic  connections  can  be  made  between  a  pre¬ 
viously  created  PPT  which  has  been  taken  from  the  phoneme  position  table,  and  a  new  word  being 
processed.  These  different  connection  modes  are:  (a)  appending,  (b)  pushing,  and  (c)  popping. 

Modes  (a)  and  (b)  are  quite  straightforward;  (c)  is  more  complex  due  to  ancestor  link  considerations. 

a.  Appending 

Appending  is  the  simplest  mode  of  connection  and  is  illustrated  by  an  example  (Fig.  -13)  of 
the  word  "man"  being  added  to  the  PPT  "the  big".  Here,  the  end  state  of  the  PPT  "the  b.g"  in 
the  grammar  has  an  exit  arc  labeled  "noun";  thus,  the  PPT  "the  big  man"  can  be  formed  directly 


Mere  formally,  an  appending  connection  is  made  to  a  rightmost  nonterminal  node  of  a  PPT. 

!t  requires  that  the  r'^ntmost  son  of  that  node  be  a  terminal  or  a  final  nonterminal.  It  also  re¬ 
quires  that  the  rightmost  state  of  that  node  have  ar  exit  arc  which  invokes  the  word  being  procec  ,ed. 

b.  Pushing 

Pushing  is  illustrated  by  a  similar  example:  the  woi  j  "beside"  being  added  to  the  PPT 
"the  book".  Here,  the  end  state  of  the  PPT  has  an  exit  labeled  PP,  and  the  PP  network  can  start 
with  a  preposition.  Thus,  the  PPT  "the  book  beside"  con  be  formed  (see  Fig.V-14).  (A  similar 
successful  push  connection  can  be  made  using  the  SENT  net.) 
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Fig.  \  -14. 


More  formally,  a  pushing  connection  can  be  made  to  a  rightmost  nonterminal  node  of  a  PPT. 
It  requires  that  the  rightmost  son  of  that  node  be  a  terminal  or  a  final  nonterminal.  It  also  re¬ 
quires  that  the  rightmost  state  of  that  node  iave  an  exit  arc  invoking  a  nonterminal  which  can 
start  (possibly  recursively'  with  an  arc  invoking  the  word  being  processed. 


Example:  In  connecting  the  noun  "man"  to  the  PPT  "the  big",  an  appending  connection  can  be 
made.  In  connecting  the  preposition  "beside"  to  the  resulting  PPT,  a  pushing  connection  must 
be  made  to  *he  PP  transition  net.  Figure  V-15  illustrates  these  two  connections. 
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c.  Popp  ng  and  Ancestor  Link  Formation 

Popping  is  attempted  if  the  PPT  is  a  terminal  or  if  it  is  a  final  nonterminal.  Popping  in¬ 
volves  making  a  svntaetic  connection  through  a  higher-level  net. 
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Fig.  V-16. 

The  first  step  in  making  a  "popping"  connection  is  finding  a  higher-level  context  for  the  PPT 
which  is  to  be  connected  (i.e.,  an  arc  which  "invokes"  that  PPT).  For  instance,  in  connecting 
a  NP  to  a  VER11,  theie  are  three  NP  context  arcs  in  the  .SEN  T  net  (see  Fig.  V-16).  Each  of  these 
context  arcs  is  taken  m  e  at  a  time,  and  an  appropriate  PPT  of  the  form  shown  in  Fig.  V-17  is 
constr’  c»ed. 


SENT 

NP 

Fig.  V-17. 

Next,  an  attempt  is  rrude  to  connect  this  new  PPT  to  the  verb.  In  only  one  (the  leftmost)  of 
the  three  context  arcs  depii  in  Fig.  V-16  will  this  attempt  succeed  since  this  is  the  only  NP 
arc  immediately  to  the  left  of  a  verb  arc. 

In  attempting  to  connect  tic  new  PP'I  io  the  verb,  the  algorithm  will  try  all  three  moues  of 
connection:  apoending,  pushing,  and  popping  (if  the  new  PPT  ends  on  a  final  state).  Thus,  the 
popping  process  is  recursive,  as  the  algorithm  tries  all  possible  combinations  of  multiple  popping 
that  might  yield  a  connection  (as  described  in  Sec.  ?-d  „elow). 

There  is  a  problem  involved  in  this  recursive  popping,  nanvdy,  the  algorithm  (as  stated  so 
far)  could  get  ..ito  an  infinite  loop.  For  instance,  since  a  noun  ,  trase  can  end  a  prepositional 
phrase  which  can,  in  turn,  end  a  noun  phrase,  the  algorithm  could  merrily  pop  up  endlessly  from 
one  to  another  it  can  be  seen  that  this  is  the  ancestor  link  problem  in  a  new  guise.  The  easiest 
solution  to  this  problem  is  just  to  keep  a  list  of  the  arcs  already  poy ’  ‘hrough  at  each  point  and 
abort  the  recursive  popping  if  an  arc  already  popped  through  is  approached  again. 

In  practice,  by  visually  inspecting  the  grammar,  one  can  insert  tests  that  abort  a  number 
of  pops  so  that  not  all  possibilities  are  attempted.  These  tests  are  put  in  to  improve  efficiency, 
but  do  not  alter  the  basic  nature  of  the  algorithm. 
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Mere  formally,  a  popping  connection  can  he  made  in  the  following  circ  j-.-tances:  (1)  If  the 
FPT  is  terminal,  or  if  the  PPT  is  a  final  nonterminal  node;  (2)  if  there  exists  a  context  arc 
(which  invokes  the  PPT)  which  has  not  been  popped  through  in  forming  the  PPT;  and  (3)  if  the 
new  Pi  formed  by  popping  to  this  arc  can  be  connected  (via  appending,  pushing,  or  popping) 
to  the  word  being  processed. 

d.  Examples  of  Popping 

The  algorithm  is  able  to  abort  the  recursive  popping  successfully  and  still  get  all  possible 
local  structures  because  it  is  set  up  to  construct  ancestor  link  structures  while  making  these 
popping  connections.  As  describ  d  in  Sec.  3  above,  an  ancestor  link  represents  an  unspecified 
chain  of  nonterminal  nodes.  Such  an  ancestor  link  is  created  when  the  algorithm  pops  through 
successive  nonterminal  nodes. 

A  number  of  examples  using  different  PPTs  can  best  serve  to  explain  how  popping  and  an¬ 
cestor  link  formation  take  place. 

Example  1 

NOUN 

I 

man 

When  pooping  with  a  terminal,  one  first  takes  each  a.c  invoking  that  terminal  and  attempts 
connections.  Thus,  the  PPT  shown  in  Fig.  V-18  would  be  constructed.  In  this  case,  a  second 
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mor 

Fig.  V-l  8. 

pop  can  be  made  recursively  since  the  new  PPT  ends  on  a  final  state.  Before  attempting  the 
recursive  pop,  however,  the  algorithm  first  examines  the  PPT  to  see  whether  it  should  be  trans¬ 
formed  into  an  ancestor  link  structure.  The  rule  is:  if  a  nonterminal  which  has  only  a  single 
son  is  to  be  popped,  then  the  nonterminal  is  replaced  by  an  ancestor  link  structure.  It  is  possible 
to  use  this  simple  rule  for  two  reasons: 

(1)  The  CONNECT  algorithm,  which  processes  these  PPTs,  is  set  up  to  expect  PPTs 
constructed  in  this  fash  n. 

(2)  In  LPARS'  grammar,  there  are  no  nonterminal  nodes  which  can  be  made  to  begin 
and  end  on  permitted  states  yet  still  have  only  a  single  son.  The  rule  would  have 
to  be  somewhat  modified  to  accommodate  sucli  nonterminals. 

In  the  above  example,  since  the  new  i  PT  to  be  popped  is  a  nonterminal  with  a  single  son, 
the  algorithm  automatically  inserts  an  ancestor  link  structure,  transforming  the  new  PPT  as 
seen  in  Fig.  V-lh. 
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Fig.  V-19. 
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Even  though  the  NP  node  has  been  deleted,  t.»e  algorithm  must  remember  that  it  is  dealing 
with  a  NP  and  therefore  use  NP  context  arcs  in  making  connections.  This  PPT,  for  instance, 
couid  be  connected  to  a  verb  as  shown  in  Fig.V-20. 
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Fig.  V-20. 

Since  a  NP  can  end  a  sentence,  a  third  recursive  pop  can  be  generated  of  the  PPT  (see 
Fig.  V-21). 


SENT 


*  / 

noun 


man 

Fig.  V-21. 

Again,  the  algorithm  inspects  the  PPT  before  allowing  the  recursive  popping  to  take  place, 
and,  in  this  case,  propagates  the  ancestor  link  deleting  the  SENT  node  and  obtaining  the  PPT 
shown  in  Fig.  V-22. 
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Fig.  V-22. 

Thus,  the  same  ancestor  link  structure  is  passed  up  through  successive  levels  of  popping. 
Again,  although  now  both  the  NP  and  SENT  nodes  have  been  deleted,  the  algorithm  must  remember 
that  it  is  dealing  with  a  '’ENT.  This  "propagation"  of  the  ancestor  link  is  done  because  any  inter¬ 
vening  nodes  are  superfluous  until  the  structure  is  finally  connected  to  the  word  being  processed, 
.’or  instance,  the  PPT  could  be  connected  to  an  "endch"  as  shown  in  Fig.  V-23. 
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Fig.  V-23. 

Example  2  -  a  Partial  NP 


adj  noun 

T  PPT  is  handled  in  a  fashion  very  similar  to  Example  1.  Here,  however,  the  ances'or 
link  structure  which  is  passed  up  is  shown  iii  Fig.  V-24. 
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Fig.  V-24. 


Example  3 


SENT^ 

^NP^  verb  (as  in  "which  the  table  supports") 

det  noun 

This  partial  SENT  PPT  is  handled  in  exactly  the  same  fashion  as  Example  2.  The  ancestor 
link  structure  used  in  the  popping  is  shown  in  Fig.  V-25. 
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Fig.  V-25. 

e.  Tests  to  Abort  Popping  in  Certain  Circumstances 

The  grammar  used  by  EPARS  is  restricted;  therefore,  the  efficiency  problems  involved  in 
multiple  popping  are  not  prohibitive.  With  more  complex  grammars,  this  problem  could  become 
more  significant. 

One  potentially  useful  technique  for  improving  efficiency  is  to  inspect  the  grammar  and  insert 
logic  to  abort  various  possible  pops  before  they  are  attempted.  This  could  be  useful  in  two  gen¬ 
eral  situations. 

(1)  When  the  word  being  connected  does  not  follow  a  particular  PPT. 

(2)  When  one  is  popping  with  an  ancestor  link  stiucture,  and  there  are  several  ways 
that  the  PPT  can  pop  to  a  higher-level  context.  For  example,  there  is  more  than 
one  arc  which  allows  a  NP  to  end  a  SENT.  Thus,  the  algorithm  as  stated  can  con¬ 
nect  a  NP  to  an  ENDCH  once  for  each  of  these  contexts,  yet  the  resulting  structures 
would  be  identical  since  the  intervening  nodes  popped  through  would  be  deleted.  One 
obviously  inefficient  solution  to  this  problem  is  to  let  several  identical  parse  trees 
be  formed,  but  let  l, PARS  ignore  the  duplicates.  Another  aroroach  is  to  inspect 
the  grammar  and  insert  logic  to  abort  popping  so  that  only  one  structure  is  ever 
built  in  such  situations. 

In  any  case,  this  issue  (although  important  for  practical  reasons  of  efficiency)  is  not  central 
to  the  basic  algorithm.  It  is  perfectly  possible  to  let  TREE  explore  all  possible  popping  paths 
every  time,  using  only  the  generai  mechanism  that  aborts  if  the  popping  starts  to  repeat  itself. 

In  effect,  this  amounts  to  letting  TREE  use  the  grammar  "interpretively."  By  inspecting  the 
grammar,  one  can  manually  "vompile"  a  set  of  tests  that  let  TREE  operate  more  efficiently. 

In  fact,  it  is  probably  possible  to  construct  a  set  of  rules  that  allows  EPARS  to  inspect  the  gram¬ 
mar  and  construct  the  tests  automatically.  This  possibility  is  not  pursued  in  the  present  work 
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f.  Two  Problems 


There  are  two  unique  problems  to  be  faced  by  a  local  parsing  strategy,  problems  which 
do  not  arise  in  a  totally  left-to-right  parsing  approach.  These  problems  arise  in  attempting  to 
pursue  the  goal  of  finding  all  possible  "maximal"  local  structures  (maximal  in  the  sense  that 
none  are  subsumed  by  others). 


Problem  1 

The  first  problem  might  be  called  that  of  "finding  all  contexts  for  possible  syntactic  connec¬ 
tions  to  rightmost  subparts"  and  can  be  illustrated  by  considering  the  following  hypothetical  input: 
"the  man  places  the  book  supports".  Among  the  PPTS  which  can  be  created  from  this  string  are 
the  two  shown  in  Fig.  V-26. 
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Fig.  V-26. 


The  problem  is  illustrated  as  follows.  Since  TREE  creates  PPTs  by  examining  the  words 
left-to-right,  at  some  point  it  has  created  the  PPT  shown  in  Fig.  V-27,  and  is  attempting  to  add 
to  this  structure  the  verb  "supports"  if  possible.  As  shown  in  Fig.  V-26,  not  only  can  "supports" 
be  connected  to  the  entire  structure  but  it  also  can  be  connected  to  the  noun  phrase  "the  book" 
in  a  different  -yntactic  context. 
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Fig.  V-27. 

Thus  when  examining  a  new  word,  THEE  must  do  more  than  just  continue  any  parsing  that 
it  has  already  developed.  It  must  also  see  if  any  rightmost  subparts  of  structures  th3t  it  has 
built  up  can  be  connected  to  the  new  word.  There  are  two  general  cases  of  this  problem: 

(1)  In  the  example  above  (Fig.  V-26)  where  a  rightmost  son  of  the  PPT  can  be  connected 
to  the  word  in  a  different  syntactic  context. 

(2)  The  other  example  is  where  one  can  connect  the  word  to  the  PPT  but  only  if  some 

of  the  leftmost  sons  of  the  PPT  are  deleted.  For  instance,  consider  the  word  string 
"which  the  table  supports  the  ashtray".  The  string  "which  the  table  supports"  can 
be  formed  into  a  sentence  fragmei  t.  Yet  "the  ashtray"  cannot  be  joined  to  this 


entire  fragment.  The  word  "which"  must  be  removed  first,  after  which  the  structure 
"the  table  supports  the  ashtray"  can  be  formed. 

This  problem  is  not  a  difficult  one  to  solve  once  it  has  been  understood,  but  it  does  require 
a  somewhat  different  parsing  strategy  than  does  a  purely  top-down  left-to-right  system. 

Problem  2 

The  second  problem  arises  directly  from  the  solution  of  the  first.  Namely,  in  solving  Prob¬ 
lem  1,  TREE  may  at  times  construct  a  parse  structure  using  a  right  nost  subpart  of  one  parse 
structure,  but  this  parse  structure  maybe  subsumed  by  some  other  parse  structure  which  TREE 
has  created.  This  second  problem,  the  elimination  of  redundant  pr"''s,  can  be  illustrated  by  the 
string  of  words  "the  book  beside  the".  The  string  "the  book  beside"  generates,  among  others, 
the  two  PPTs  shown  in  Fig.V-28. 
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Fig.  V-28. 

When  "the"  is  added  to  PPT  1,  it  yields  PPT  3  (see  Fig.  V-2<>1.  However,  any  attempt  to 
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Fig.  V-29. 

add  "the"  to  PPT  2  will  clearly  fail.  On  the  other  hand,  "the"  can  be  connected  to  a  rightrn^ 
subpart  of  PPT  2  ("beside")  in  another  syntactic  context,  yielding  PPT  4  (see  Fig.  V-  so) .  But, 
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Fig.  V-30. 

PPT  4  is  subsumed  by  PPT  3  and  'herefore  is  not  wanted.  In  practice,  LPARS  creates  each  PPT 
tentatively  and  throws  it  away  if  it.  is  subsumed  by  a  previously  constructed  PPT,  or  by  a  PPT 
constru:  .ed  later. 
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The  two  problems  discussed  above  arise  from  the  local  nature  of  LPARS'  parsing.  Prob¬ 
lem  1  arises  from  the  need  to  form  all  possible  syntactic  connections.  In  the  process  of  finding 
these,  however,  the  algorithm  may  also  find  some  redundant  structures.  This  gives  rise  to 
Problem  2,  the  need  to  throw  such  redundant  structures  away.  Possibly,  a  more  elegant  solution 
allows  the  construction  of  all  valid  parse  trees  without  the  redundancy  inherent  in  the  present 
algorithm 

4.  Some  Remarks  on  TREE 

This  section  makes  two  remarks  about  the  nature  of  the  TREE  algorithm  and  its  relationship 
to  other  parsing  methods. 

a.  Efficiency  Considerations  for  the  Local  Approach 

The  TREE  algorithm  may  seem  to  be  inefficient.  It  certainly  can  construct  a  iair  number 
of  different  parse  structures  out  of  the  same  woros.  It  must  be  remembered,  however,  that  all 
parsers  do  this  to  some  degree  using  backup,  the  only  difference  being  that,  when  backup  is  used, 
only  one  partial  structure  exists  at  a  time. 

Some  left-to-right  parsers,  such  as  Earley's,^  do  construct  multiple  parse  trees.  In 
theory,  such  an  algorithm  is  just  as  efficient  as  one  using  backup.  Of  course,  TREE  would 
probably  construct  more  partial  parse  structures  than  Earley’s  parser,  given  the  same  grammar 
and  the  same  input,  be  ause  Earley's  algorithm  expects  to  be  given  all  the  tokens  in  the  sentence 
and  thus  reed  only  construct  PPTs  which  make  sense  in  the  possible  left-to-right  contexts  present. 

However,  the  basic  point  is  that,  although  the  number  of  PPTs  constructed  by  TREE  may  at 
first  appear  large,  this  is  partly  because  TREE  does  not  use  backup  to  effectively  erase  part  of 
its  work,  but  rather  sets  it  all  up  for  everyone  to  see.  In  fact,  by  doing  all  the  work  once  and 
for  all  before  calling  the  CONNECT  routine,  it  helps  make  I  PARS  a  more  efficient  system  than 
if  it  had  to  redo  this  parsing  many  times  as  CONNECT  tried  out  different  possible  ways  of  hypoth¬ 
esizing  unfound  words  between  structures  in  different  parts  of  the  sentence. 

b.  I  nique  Features  of  the  TREE  Algorithm 

The  local  nature  of  the  TREE  algorithm  results  in  a  number  of  unique  problems  which  do  not 
occur  in  other  parsers.  It  is  useful  to  outline  these  specifically: 

(1)  The  ancestor  link  problem  is  unique  to  an  approach  which  attempts  to  create  local 
structures. 

(2)  The  necessity  of  finding  all  possible  syntactic  connections  between  structures  being 
built  up  is  a  uniqje  part  of  the  local  approach.  In  the  particular  left-to-right  ap¬ 
proach  10  building  local  structures  used  ny  THEE,  this  necessity  gave  rise  to  the 
two  interesting  problems  discussed  in  Sec.C-3-f  above. 

(3)  The  need  to  have  a  syntactic  formalism  which  lets  parsing  originate  anywhere  in 
the  grammar  and  proceed  in  an  organized  fashion  is  unique  to  the  local  approach. 

D.  THE  CONNECT  ALGORITHM 

The  task  of  the  CONNECT  algorithm  is  to  take  two  PPTs  constructed  by  the  TREE  algorithm 
and  use  the  grammar  to  propose  possible  word-strings  which  might  exist  between  the  two  PPTs 
(see  F  ig.  \  -31 ) . 


51 


■’’■'■THHi.iiMiim 


LPPT  RPPT 


Fig.  V-31. 


We  often  refer  to  the  two  PPTs  as  the  left-hand  PPT  (I.PPT)  and  the  right-hand  PPT 
(HPPT).  These  PPTs  bound  an  input  segment  which  is  to  be  investigated.  CONNECT  proposes 
word-strings,  consisting  of  at  most  one  content  word  plus  a  small  number  of  function  words, 
which  might  fill  the  input  segment.  Then,  CONNECT  tests  these  hypotheses  against  the  input 
segment  at  a  high  phonetic  distance.  If  any  match  succeeds,  the  combined  PPT  is  added  to  the 
collection  of  PPTs  being  investigated.  When  processing  two  input  PPTs  in  this  fashion.  CONNECT 

tests  out  all  possible  hypotheses  which  are  syntactically  and  semantically  permissible. 

« 

1 .  Why  Only  One  Content  Word? 

It  is  reasonable  to  ask  why  CONNECT  searches  only  for  word-strings  containing,  at  most, 
one  content  word  plus  a  small  number  of  function  words.  Clearly,  one  might  have  allowed 
CONNECT  to  propose  a  longer  string  of  words.  Some  limit  must  be  set,  however,  and  the  design 
philosophy  of  I. PARS  is  to  keep  this  limit  fairly  low.  At  the  opposite  extreme  to  this  philosophy, 
one  approach  to  sentence  recognition  might  be  to  generate  all  possible  sentences  and  match  each 
against  the  entire  input.  This  approach  is  clearly  inefficient.  The  while  structure  of  LPARS 
is  designed  to  avoid  such  an  approach.  Thus,  the  idea  of  proposing  a  large  number  of  long  word- 
strings  without  looking  at  the  actual  phonemic  input  is  avoided  as  much  as  possible  by  LPARS. 

If  there  is  a  large  section  of  input  between  two  PPTs,  LPARS  can  make  a  higher  distance  scan 
through  that  section  and  use  the  words  found  as  input  to  the  TREE  algorithm,  or  use  other  fa)  - 
back  mechanisms  as  is  discussed  in  Sec.  VI. 

It  would  be  simple  to  expand  the  "single  content  word"  constraint  described  above  to  make 
it  more  flexible.  One  might  also  allow  word-strings  consisting  of  two  content  words  and,  at 
most,  one  function  word.  Alternately,  if  the  input  section  between  is  fairly  small,  one  might 
want  to  look  for,  at  most,  one  or  two  function  words.  The  choice  made  in  LPARS  is  somewhat 
arbitrary,  and  is  certainly  not  the  only  possibility. 


2.  An  Introductory  Example 


As  discussed  above,  the  task  of  the  CONNECT  algorithm  is  to  find  all  word-strings  that 
might  exist  between  two  PPTs,  with  the  constraint  that  no  word -string  contain  more  than  one 
content  word.  This  section  discusses  a  lew  examples  to  illustrate  wha*  is  involved  in  this  process. 
Consider  the  two  °PTs  shown  in  Fig.  V-32. 
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Fig.  V-32. 
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Sample  word -strings  wt.ich  might  connect  these  PPTs  are  "supports  the",  or  "is  contained 
by  the". 

Consider  the  next  two  PPTs  (Fig.  V-33). 
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Fig.  V-33. 

Sample  word-strings  which  might  connect  these  PPTs  include:  "beside  the",  "in  the  big", 
"which  the",  "in  which  the",  "supports  in  the"  (as  in  "put  the  pipe  which  the  dictionary  supports 
in  the  red  ashtray").  In  fact,  there  are  a  great  many  possible  word -strings  which  could  connect 
these  two  PPT3.  (This  fact  raises  the  efficiency  question  of  whether  one  might  want  to  us  the 
CONNECT  algorithm  selectively.  The  answer  to  this  question  is  "yes,"  and  is  discussed  more 
fully  in  Sec.  VI.) 

Now  let  us  try  to  get  a  feeling  for  the  logic  involved  in  generating  these  possible  word-strings. 


3.  Extending  Paths  Through  the  Net 

This  section  makes  some  preliminary  comments  to  help  make  the  CONNECT  algorithm's 
activity  easier  to  visualize. 

Fundamental  to  this  activity  is  the  fact  that  every  PPT  is  very  closely  tied  to  the  transition 
network  grammar  (as  discussed  above  in  Sec.B).  The  sequence  of  sons  of  each  node  represents 
a  permitted  path  through  the  associated  network,  and  each  PPT  node  has  pointers  to  the  states 
in  the  net  on  which  the  path  begins  and  ends.  Therefore,  it  is  very  easy  to  explore  ways  in  which 
the  path  might  be  extended  (by  examining  arcs  emanating  from  the  end  state). 

Let  us  first  take  a  simple  example.  Suppose  we  are  given  the  partial  noun  phrase  "the  big" 
and  asked  to  use  the  grammar  to  see  if  we  can  make  this  PPT  end  on  a  final  state  by  hypothe¬ 
sizing  at  most  one  content  word  (see  Fig.  V-34). 
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Fig.  V-34. 


To  answer  this  question,  we  must  look  at  the  end  state  of  the  PPT  in  the  net  (state  NP2)  and 
follow  the  arcs  that  emanate  from  it.  We  then  see  that  the  PPT  can  be  made  to  end  on  a  final 
state  by  hypothesizing  a  noun.  The  resulting  "hypothesized  PPT,"  including  the  proposed  word 
is  shown  in  Fig.  V-35.  In  this  case,  nr  other  structures  are  possible. 
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Fig.  V-35. 
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Now  let  us  look  at  a  more  complex  example  of  which  the  previous  example  is  a  sub-pait. 
Suppose  that  the  sentence  (SENT)  net  is  as  shown  in  Fig.  V-36. 
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Fig.  V-36. 

Further  suppose  that  the  two  PPTs  of  Fig.  V-37  have  been  found 
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Fig.  V-37. 


The  question  now  is:  Can  these  two  PPTs  be  joined  by  proposing  at  most  one  content  word? 
For  an  answer,  we  first  look  at  the  SENT  node  of  each  PPT  and  observe  that:  (1)  the  SENT  node 
of  PPT  i  ends  on  state  2,  and  (2)  the  SENT  node  of  PPT  2  begins  on  state  2.  Thus,  these  two 
paths  can  be  directly  concatenated.  But,  for  the  resulting  structure  to  be  a  valid  PPT,  all  inner¬ 
most  nonterminal  sons  must  begin  and  end  on  permitted  states. 

Hence,  the  two  SENT  nodes  can  be  connected  only  if  the  noun  phrase  "the  big"  can  be  made 
to  end  on  "  final  state  by  proposing,  at  most,  or"  content  word.  As  we  saw  above,  this  can  be 
done  by  proposing  a  noun.  Therefore,  the  two  PPTs  can  be  joined  into  the  "hypothesized  PPT" 
shown  in  Fig.  V-38. 
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Fig.  V-38. 


This  simple  example  illustrates  quite  clearly  the  heart  of  the  CONNECT  algorithm,  namely: 

(1)  On  3ome  level,  paths  from  both  PPTs  must  be  joined. 

(2)  All  inner  nodes  below  this  joining  must  be  made  to  begin  or  end  (as  appropriate) 
on  permitted  states. 

These  two  basic  principles  are  the  key  to  the  operation  of  the  CONNECT  algorithm.  There 
are,  unfortunately,  a  number  ot  complications  which  make  the  algorithm  slightly  more  complex 
than  the  above  example  might  seem  to  indicate.  Nevertheless,  the  basic  principles  remain  those 
stated  above. 
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The  two  main  complications  are: 


(1)  Having  to  pop  to  a  higher-level  context  to  make  a  connection:  For  instance,  given 
two  noun  phrases,  one  possible  word-string  connection  is  a  verb.  To  find  this 
possibility,  the  algorithm  must  pop  to  the  SENT  net  (see  Fig.  V-39). 


Fig.  V-39. 

(2)  Ancestor  link  considerations:  The  necessity  to  accommodate  ancestor  links  poses 
further  considerations.  For  instance,  when  connecting  the  two  PPTs  shown  in 
Fig.  V-40. 
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Fig.  V-40. 

First,  the  SENT1  nodes  are  joined,  and  then  the  SENT  node  in  PPT  1  is  extended  to  join 
the  ancestor  link  son  of  PPT  2,  generating  the  combined  PPT  (see  Fig.  V-41). 
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Fig.  V-4i . 

Notice  that,  in  this  example  (due  to  the  ancestor  link),  paths  in  the  two  PPTs  were  joined 
at  two  levels  of  structure.  In  a  more  complex  example,  paths  can  be  joined  at  several  levels. 
(In  fact,  one  can  think  of  the  CONNECT  algorithm  as  being  able  to  "zip"  two  partial  structures 
together.) 


4.  Two  Parts  of  CONNECT 

The  preceding  section  discussed  the  basic  principles  involved  in  the  operation  of  the  CONNECT 
algorithm.  We  now  look  at  this  algorithm  in  more  detail.  There  are  two  distinct  parts  of  the 
CONNECT  algorithm: 

(a)  First  is  the  JOIN  routine  which  joins  the  higher-level  structure  of  the  PPTs, 

"zipping"  them  together  from  tor  to  bottom,  thereby  constructing  a  hypothetical 
com  lined  tree.  This  combin'd  tree  indicates  a  possible  word- string  that  might 
exist  hetween  the  input  PPTs.  The  JOIN  routine  by  itself  determines  only  tne  parts 
of  speech  of  the  possible  word-strings. 


(b)  Second  is  the  CONSTRAIN  routine  which  inspects  a  tree,  created  by  JOIN,  with  its 
hypothesized  words,  and  uses  the  contextual  syntactic  and  semantic  information  to 
compile  additional  constraints  beyond  mere  parts  of  speech,  further  narrowing 
down  what  the  hypothesized  string  of  words  might  be.  (For  instance,  if  the  unclear 
section  might  contain  a  verb,  inspection  of  the  context  might  allow  one  to  rule  out 
s  number  of  verbs  in  the  vocabulary.) 

The  following  sections  give  a  simple  example  of  these  two  routines  in  operation,  and  then 
describe  each  in  turn.  The  description  of  the  algorithm  ignores  lambda  transitions.  This  sim¬ 
plification  allows  the  description  to  be  somewhat  cleaner,  and  does  not  lose  any  of  the  overall 
flavor  of  the  algorithms  themselves. 

a.  Simple  Example 

We  now  give  a  simple  example  to  help  clarify  the  individual  functions  of  the  two  parts  of 
CONNECT,  and  s'  ow  how  they  fit  together  before  describing  them  in  detail.  Let  us  take  the 
example  given  in  the  introduction:  two  noun  phrases  at  the  beginning  and  end  of  the  sentence, 
with  a  gap  between  them  (Fig.  V-42). 
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Fig.  V-42. 


The  task  of  CONNECT  is  to  fino  all  the  ways  in  which  the  two  PPTs  can  be  joined  together 
by  words  which  *  aght  exist  between  them.  The  algorithm  starts  at  the  top  of  the  two  PPTs  and 
recognizes  that  each  is  headed  by  a  bENTl  node,  and  that  these  two  nodes  car.  be  joined.  The 
algorithm  therefore  constructs  the  structure  presented  in  Fig.  V-43. 


SENT^ 

strch  ?  endch 


Fig,  V-43. 

The  algorithm  places  a  pointer  to  this  combined  structure  in  a  global  location.  The  son 
labeled  "?"  represents  a  cell  in  a  list  of  sons  which  is  left  temporarily  empty.  It  will  later  be 
filled  in  as  processing  proceeds.  The  routine  now  calls  itself  recursively  with  the  two  PPTs 
of  Fig.  V-44  as  arguments.  The  address  of  the  empty  son  of  the  higher-level  structure  is  also 
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NP 

A 

Fig.,V  -44, 
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passed  down.  These  two  PPTs  are  the  inner  sons  of  the  original  two  FPTs.  Along  with  these 
two  structures,  the  algorithm  is  passed  additional  information  indicating; 


( 1 )  That  it  is  no  longer  processing  the  highest-level  nodes  of  the  trees  and  thus,  for 
instance,  no  popping  connections  can  be  attempted. 

(2)  That  both  PPTs  are  bounded  by  outside  context  (the  start  and  end  characters)  and, 
therefore,  that  when  processing  is  finished  all  inner  nonterminals  must  begin  or 
end  (as  appropriate)  on  permitted  states. 

The  algorithm  proceeds  to  test  for  possible  intervening  words  by  extending  the  path  of  the 
SENT  node  of  the  LPPT  starting  at  the  end  state.  In  so  doing,  it  tries  to  complete  the  SENT 
node,  incorporating  the  RPPT  (the  NP)  into  the  structure.  It  finds  that  it  can  successfully  do 
this  by  following  a  verb  arc  and  an  affix  arc  to  the  direct  object  NP  arc.  It  therefore  constructs 
the  PPT  shown  in  Fig.  V-45. 
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Fig.  V-45. 

This  PPT  is  then  attached  to  the  previously  built  structure  by  placing  a  punter  to  this  PPT 
in  the  cell  marked  "?"  in  the  higher-level  structure  above.  Thus,  the  PPT  of  Fig.  V-46  is  built 
and  is  accessible  through  a  global  pointer. 


SENT1 


Fig.  V-46. 


Now  the  first  part  of  the  CONNECT  process,  the  joining  of  higher-level  structure,  is  com¬ 
pleted.  In  more  complex  examples,  this  process  can  take  longer  and  involve  several  levels  of 
recursion  as  several  levels  of  structure  are  "zipped"  together. 

The  PPT  is  now  passed  on  to  the  second  part  of  the  CONNECT  algorithm,  CONSTRAIN, 
which  examines  the  entire  structure  to  find  additional  constraints  which  ould  be  placed  on  the 
verb,  such  as  "transitive,"  "must  be  able  to  take  a  certain  class  of  noun  as  subject  and  as  object," 
etc.  It  then  tests  all  verbs  satisfying  these  constraints  against  the  input. 

After  the  processing  of  this  first  possibility  is  complete,  CONNECT  resumes  trying  to  find 
different  ways  of  joining  the  two  structures.  Thus,  it  would  also  find  the  passive  sentence  form, 
proposing  a  word-string  of  the  form  "BE  VERB  AFFIX  BY"  as  in  "is  rupported  by". 

Also,  had  the  SENT  node  of  the  LPPT  ended  on  a  permitted  state,  the  algorithm  could  have 
dropped  down  recursively  and  attempted  to  connect  the  two  structures  given  in  Fig.  V-47, 

NP  NP 

A  A 

Fig.  V-47. 
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Tnese  could  be  connected  by  a  number  ^.i  word-strings  including:  PREP,  PREP  WH,  or 
WH.  In  fact,  if  the  right-hand  NP  had  not  originally  been  connected  to  the  ENDCH,  the  algorithm 
would  indeed  have  found  these  possibilities.  The  necessity  that  all  innermost  nodes  begin  and  end 
on  permitted  states,  however,  made  it  impossible  for  the  algorithm  to  drop  down  below  the  SENT 
node  in  trying  to  connect  tuo  NP. 

The  following  sections  describe  the  two  parts  of  this  algorithm  in  more  detail. 

b.  JOIN  Routine 

This  section  describes  the  logic  of  th">  JOIN  routine  which  takes  two  PPTs  and  creates  all 
possible  hypothetical  combined  PPTs  which  can  be  made  connecting  them. 

The  JOIN  routine  does  all  its  processing  of  the  transition  network  grammar  in  a  1  eft-to-right 
fashion,  starting  from  the  end  states  of  the  left-hand  PPT  (LPPT)  and  exploring  v:~  ,  in  which 
paths  can  be  extended  from  these  states  to  join  the  RPPT. 

To  illustrate  how  this  processing  is  performed,  we  first  break  the  problem  down  into  several 
components  and  describe  each  first. 

(1)  How  an  existing  path  can  be  extended  in  a  single  net. 

(Z)  How  a  hypothesized  path  can  be  initiated  in  a  net. 

(3)  How  a  path  can  be  continued  in  a  lower  net. 

(4)  How  a  path  extended  from  the  LPPT  can  be  joined  to  the  RPPT. 

Finally,  we  discuss  how  the  entire  algorithm  is  organized  to  explore  all  possible  connections. 

(1)  Extending  a  Path 

A  PPT  node  has  a  sequence  of  sons  which  represents  a  path  through  the  associated  transition 
network.  This  path  can  be  extended  by  looking  at  each  exit  arc  emanating  from  its  end  state. 

(a)  If  the  arc  is  a  terminal  arc,  then  an  extension  to  the  path  can  be  made  readily.  To 
do  this,  JOIN  constructs  a  new  extended  PPT  and  turns  this  over  recursively  for 
further  extension  (see  Fig.  V-48). 

SENT  SENT  SENT  SENT 

I  —  /  \  —  /  i  \  —  ^7'  W  — 

NP  NP  be  NP  be  verb  NP  be  verb  off!x 

A  A  I  All  A  I  I  I 

?  i  i  iii 


(b)  If  the  arc  is  a  nonterminal,  however,  the  algorithm  cannot  continue  to  extend  the 
path  in  the  current  net,  unless  the  nonterminal  itself  can  be  completed  with,  at 
most,  one  content  word.  There  are  two  such  nonterminals  in  LPARS1  grammar: 
NP,  which  can  be  completed  by  (DET  NOUN);  /nd  PP,  which  can  be  completed  by 
(PREP  DET  NOUN)  where  the  preposition  is  "in"  or  "on".  JOIN  therefore  treats 
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these  nonterminals  like  terminals  when  extending  a  path  and  inserts  the  appropriate 
structures  into  the  hypothesized  extended  PPT  isee  Fig.  V-49). 
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Fig.  V-49. 


The  algorithm  can  continue  recursively  to  extend  a  path  until  a  state  with  no  permitted  exits 
is  reached,  or  unti'  it  reaches  a  state  from  which  it  could  only  continue  by  hypothesizing  twc 
content  words. 


(2)  Starting  a  Proposed  Path  rrom  Scratch 

The  JOIN  algorithm  can  also  start  up  a  hypothesized  path  from  scratch  in  a  given  transition 
neti  tk  ir  a  fashion  very  similar  to  tha*  inscribed  above.  It  merely  takes  each  initial  state  of 
the  t  N  and  looks  at  the  arcs  emanating  .  ,  m  it  and  proceeds  as  above. 


(3)  Continuing  an  Extended  Path  in  a  Lower  Net 


Arc'.'.'.er  possible  way  to  extend  a  path  involves  pushing  to  a  lower  net,  and  continuing  the 
extension  of  the  path  in  t..at  lower  net.  This  involves  a  ocmbination  tFig.  V-50)  of  the  techniques 
described  in  the  previous  two  sections. 
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Fig.v-ro. 

(4)  Joining  ;.n  LPPT  Path  to  toe  RPPT 

There  are  three  fundamental  ways  (cal*ed  Modi  s  1 ,  2,  and  3)  in  which  an  LPPT  with  an  ex¬ 
tended  path  can  he  joined  *o  an  HPPT. 

Mode  1 : -  If  the  end  state  of  the  LPPT  equals  the  start  state  of  the  HPPT. 

In  this  ase,  the  two  paths  can  be  directly  concatonnte-d,  assuming  that  all  exit  action  routines 
are  compatible  (which  can  be  checked  since  each  node  includes  a  list  of  arcs  taken  by  its  path) 

(see  F lg.  V-St ). 
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Fig.  V -51. 
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The  algorithm  must  then  ascertain  that  the  inner  nodes  begin  and  end  on  permitted  states 
This  is  done  in  LPARS  by  left-to-right  extension  of  ,iaths.  Thus,  a  node  is  made  to  end  on  a 
final  state  b>  following  arcs  from  the  er.d  state  of  its  path.  Conversely,  a  node  is  n.ade  to  begin 
on  an  initial  state  by  initiating  a  path  at  ea^h  possible  initial  state  and  attempting  to  extend  this 
path  to  join  the  node. 

Mode  2:-  If  an  exit  arc  from  the  endstate  of  the  LPPT  invokes  the  RPPT  (Fig.  V -521.  Again, 
the  algorithm  must  still  make  the  inner  nodes  begin  and  end  on  permitted  states. 
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Fig.  V-52. 

Mode  3:-  If  the  RPPT  has  an  ancestor  link  son,  and  the  end  state  of  the  LPPT  equals  the  start 
state  of  the  RPPT  (Fig.  V-  i).  This  example  is  somewhat  harder  to  visualize  since,  although 
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Fig.  V-53. 

the  two  nodes  have  been  joined,  additional  processing  must  be  done  to  complete  the  connection. 

In  the  example  above,  one  must  initiate  a  path  in  the  PP  net  to  be  joined  to  the  ancestor  link  son, 
thus  generating  the  following  possible  structu-e  (Fig.  Y-54)  and  creating  the  following  entire 
structure  (Fig.V-55). 
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Fig.  \  -54.  Fig.  V-55. 

(5)  Overall  Control  of  CONNECT 

The  preceding  sections  attempted  to  give  a  sound  verbal  overview  of  the  logic  involved  in 
the  CONNECT  routine.  The  following  program  outline  describes  the  overall  control  program 
whi'h  coordinates  the  CONNECT  algorithm  so  that,  given  two  PPTs,  it  will  find  all  possible 
in  -rvening  word-strings. 

Thr  only  part  of  CONNECT  which  it  does  not  include  is  the  popping  to  higher-level  contexts 
with  the  LPPT,  an-*  the  application  of  this  algorithm  to  the  PPT  resulting  from  the  pop. 


BEGIN  CONNECT* L.BPT,  RPPT) 

(1)  If  I  PPT  is  NIL,  or  RPPT  is  Nil.  return. 

(2)  If  1  PPT  can  be  directly  joined  (with  no  extension  o*'  the  path)  to  RPPT  b> 

Mode  1,  and  if  ANCESTOR(llPPT)  equals  true  then  call  CONNECT* R.SONI  1  PPT), 
1  SON*  RPPT)). 

(fomment:  This  is  the  step  that  handles  the  zipping  together  of  higher-level 
structure.  As  discussed  in  Sec.  D-3-a,  before  CONNECT  is  called  recursively, 
a  merged  structure  is  constructed  from  the  current  nodes,  containing  s  tem¬ 
porarily  empty  sen.  and  the  address  of  this  son  is  available  to  the  lower  invr 
cation  of  CONNECT.) 

(3)  Call  J01N(L.PPT,  RPPT). 

(4)  If  this  is  the  top  level  of  CONNECT  or  if  FINAI.(l.PPT)  equals  true,  then  call 
CONNECT(RSON( L.PPT ),  RPPT). 

END 

ANCESTO  ’*N)  returns  true  if  the  leftmost  son  of  node  N  is  an  ancestor  link  son. 

RSON(N)  returns  a  pointer  to  the  rightmost  son  of  node  N,  or  Nil.  if  N  is  terminal. 

I.SON(N)  returns  a  pointer  to  the  leftmost  son  of  node  N.  or  Nil.  if  N  is  terminal 

FINAl.(N)  returns  true  if  node  N  ends  on  a  permitted  final  state. 

JOIN(L,  R):  The  JOIN  routine  performs  the  path  extension  and  joining  described  in  the  pre¬ 
vious  sections.  It  also  t-xes  care  of  making  sure  innermost  nodes  begin  and  end  on  permitted 
states.  If  there  is  a  higher  level  of  merged  structure,  it  places  a  pointer  to  any  current  "joined" 
PPT  in*o  the  "empty"  son  cell  of  that  higher  structure.  Finally,  it  calis  the  CONSTRAIN  routine 
whenever  any  combined  PPT  is  constructed. 

(6)  Attempting  All  Possible  Connections 

As  described  above,  the  CONNECT  algorithm  automatically  attempts  to  connect  the  RPPT 
to  lower  nodes  of  the  LPPT  as  well  as  just  to  the  top  node  (by  dropping  down  recursively). 

Clearly,  the  converse  situation  is  also  desired:  namely,  to  attempt  to  join  the  top  node  of  the 
LPPT  to  lr  wer  nodes  of  the  RPPT  (for  instance,  see  Fig.  V-56). 
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Fig.  V-56. 

To  allow  this  to  happen,  CONNECT  is  first  called  with  the  top  nodes  of  the  two  PPTs  as 
arguments,  and  then  drops  down  to  successive  inner  sons  of  the  RPPT  and  calls  JOIN  with  each 
of  those  nodes,  and  the  L.PPT  as  arguments.  When  attempting  these  connections,  the  algorithm 
does  not  allow  the  top  level  of  the  RPPT  to  drop  down  recursively. 

The  only  other  possible  connections  which  must  be  attempted  are  those  made  possible  by 
popping  with  the  LPPT  (see  Fig.  V-S7).  The  only  difference  between  this  popping  and  the  popping 
connections  discussed  in  the  TREE  algorithm  is  that,  in  this  case,  the  PPT  being  popped  need 
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Fig.  V-57. 

not  end  on  a  final  state.  It  is  only  necessary  that  it  can  be  made  to  end  or  a  final  state  by  pro¬ 
posing  a  string  of  words  containing,  at  most,  one  content  word. 

Thus,  the  CONNECT  algorithm  tries  all  possible  pops  it  can  make  with  the  LPPT,  and  at¬ 
tempts  to  join  each  resulting  structure  to  the  RPPT. 


5.  CONSTRAIN:  Compiling  Additional  Constraints 

The  part  of  the  CONNECT  algorithm  described  so  far  connects  two  PPTs  by  proposing  words 
that  might  exist  in  betv  -‘en,  but  only  determines  the  parts  of  speech  of  the  words  proposed.  From 
examining  the  context  of  these  hypothesized  wo-ds  (i.e.,  the  entire  combined  PPT),  one  can  de¬ 
duce  additional  constraints  as  to  what  the  words  might  be.  The  information  needed  for  deducing 
these  additional  constraints  is  contained  in  the  conditions  which  augment  the  arcs  in  the  grammar. 

An  example  is  probably  useful  to  highlight  exactly  what  ihe  problem  here  is.  Suppose  the 
two  PPTs  shown  in  Fig.  V-58  were  joined  by  hypothesizing  a  verb  between  them. 
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Fig.  V-58. 


A  person  looking  a'  this  structure  would  know  that  not  all  verbs  could  exist  in  this  particular 
context.  The  verb  must  be  transitive;  it  must  not  require  an  animate  subject,  for  instance;  and 
it  must  be  something  a  table  can  do  to  a  book.  In  the  normal  operation  of  the  transition  net  gram¬ 
mar,  the  tests  that  enforce  these  constraints  are  contained  in  the  conditions  associated  with  the 
arcs  in  the  grammar. 

Thus,  one  possible  way  to  determine  what  verb  could  fit  into  this  context  is  to  take  each  verb 
in  turn,  and  reparse  the  sentence  with  that  verb  and  see  if  the  parsing  succeeds.  This  is  a  very 
time-consuming  and  inelegant  approach.  Somehow  we  want  a  way  of  "turning  the  constraints 
around"  and  have  them  tell  us  what  they  want,  rather  than  just  telling  whether  a  particular  string 
of  words  is  acceptable  or  not. 

In  other  words,  the  constraints  must  actively  suggest  action  rather  than  merely  answer 
yes-no  questions.  This,  in  fact  is  exactly  what  LPARS  allows.  The  constraint  tests  in  the 
arcs  are  written  so  that  they  can  be  run  in  two  modes: 
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(1)  In  one  mode,  the  tests  are  given  two  lists  of  information  representing  features 
associated  with  two  structures  which  are  to  be  checked  for  consistency.  The  test 
will  return  NIL  if  the  arc  cannot  be  traversed,  or  else  return  TRUE. 

(2)  In  the  other  mode,  the  tests  are  passed  one  feature  list  and  the  atom  "QUESTION" 
in  place  of  the  other  feature  list,  indicating  that  LPARS  wants  the  test  to  tell  it 
what  constraints  must  be  satisfied,  and  then  to  return  TRUE  just  as  if  those  condi¬ 
tions  had  indeed  been  met.  Each  such  constraint  which  must  be  met  is  appended 

to  a  global  list. 

(To  allow  constraints  to  be  placed  on  a  string  of  words,  the  system  actually  uses 
a  set  of  dist'nct  QUESTION  atoms  (Qt,Q2,Q3,Q4, .  .  .)  depending  on  the  position  of 
the  word  in  the  proposed  word  string.) 

When  a  combined  PPT  has  been  tound  and  passed  to  the  CONSTRAIN  algorithm,  a  QUESTION 
atom  is  associated  with  each  word  which  has  been  hypothesized.  After  processing  the  combined 
PPT,  CONNECT  then  has  a  list  of  additional  constraints  which  it  can  use  to  search  its  dictionary 
quickly  to  come  up  with  a  list  of  words  which  would  fit. 

This  ''turning  around  of  the  constraints"  allows  the  algorithm  to  process  the  PPT  only  once, 
rather  than  once  for  each  word  of  the  proposed  part  of  speech. 

A  simp)"  example  is  probably  useful  at  this  point.  Suppose  we  have  the  situation  described 
above:  a  verb  has  been  hypothesized  between  the  two  noun  phrases  "the  table"  and  "the  book". 

Mode  1 :  Normally,  when  parsing  a  sentence,  the  algorithm  would  do  the  following. 

(1)  When  parsing  "the  table",  it  associates  the  semantic  information  describing  this 
noun  phrase  with  a  marker  called  SSUB  in  the  list  of  information  associated  with 
the  SENT  mode. 

(2)  When  parsing  the  verb,  it  assures  subject-verb  agreement  between  the  information 
tagged  SSUB  and  the  information  associated  with  tit  verb.  For  instance,  if  the  verb 
required  an  inanimate  subject,  the  test  assures  that  the  subject  had  been  animate. 

(3)  When  parsing  the  second  NP,  it  similarly  assures  whatever  agreement  was 
appropriate. 

(4)  When  making  the  final  state  test,  it  makes  sure  that  the  verb  was  transitive. 

Mode  2:  Since  we  do  not  know  what  the  verb  is,  we  construct  the  PPT  given  in  Fig.  V-59. 
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Fig.  V-59. 

This  PPT  is  then  subjected  to  the  following  parsing  process: 

(1)  When  parsing  the  first  NP,  the  parser  behaves  just  as  in  Mode  1  since  nothing  is 
changed. 
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(2)  When  parsing  the  verb,  Mode  2  is  activated.  The  algoiithm  must  determine  what 
constraints  are  to  bo  satisfied.  Since  the  •  ubject  is  not  animate,  the  constraint 
(AN1M  MINUS)  is  added  to  the  global  list  of  constraints  associated  with  the  QUESTION 
atom  Ql,  indicating  that  the  verb  must  not  be  one  which  requires  an  animate  subject. 

In  this  case,  the  constraint  (CONT  MINUS)  can  alsc  be  added,  indicating  that  the 
verb  could  not  be  a  "containing"  verb  such  as  "contain"  or  "hold"  since  "table"  does 
not  make  sense  as  the  subject  of  such  a  vprb.  A  new  list  of  information  to  be  as¬ 
sociated  with  the  SENT  node  is  constructed,  with  a  QUESTION  atom  (Ql)  being 
associated  with  the  tag  VERB. 

(3)  When  parsing  the  second  NP,  additional  constraints  might  be  found  in  much  the 
same  fashion. 

(4)  When  performing  the  final  state  test,  the  test  which  assures  that  the  verb  is  transi¬ 
tive  sees  the  atom  QUESTION  associated  with  VERB,  and  would  therefore  add  the 
constraint  (VERB  TRANS)  to  the  list. 

The  net  effect  of  this  activity  is  to  create  the  following  list  of  constraints  ((AN1M  MINUS) 
(CONT  MINUS)  (VERB  TRANS)).  A  list  of  verbs  that  match  these  specifications  is  made,  and 
tested  against  the  input  Any  which  succeed  are  incorporated  into  new  PPTs  and  added  to  the 
list  of  PPTs  which  LPARS  is  using. 
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VI.  OVERALL  DESIGN  OF  THE  LPARS  SYSTEM 


In  this  section,  we  review  the  whole  of  the  LPARS  system  and  discuss  how  the  different  parts 
fit  together.  We  then  describe  the  fallback  (or  backup)  mechanism  built  into  LPARS.  Finally, 
some  inter  ‘ir  '  issues  involved  in  coordinating  the  virious  subparts  of  the  system  are  discussed 

A.  OVERVIEW  OF  THE  LPARS  SYSTEM 

Figure  VI- 1  outlines  the  basic  flow  of  control  in  LPARS,  this  involves  4  basic  steps 
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1.  Initial  Scan 

The  first  step  is  the  initial  scan  made  at  low  phonetic  distance  through  the  entire  utterance 
looking  for  lairl.v  long  words  that  are  not  too  garbled. 

2.  LOCA LMATCH  Routine 

The  second  step  is  the  local  higher-distance  matching,  searching  for  strings  of  small  words 
and  for  more  highly  garbled  longer  words,  in  the  areas  beside  and  between  the  words  found  in  the 
initial  scan.  This  matching  is  done  based  on  very  local  criteria,  with  no  attempt  to  enforce  anv 
global  coordination  between  processing  going  on  in  different  parts  of  the  utterance. 

3.  TREE  Algorithm 

All  words  found  b>  the  initial  scan  and  the  l.tX’ALMATCII  routine  are  input  to  the  TREE 
algorithm  which  constructs  as  many  different  syntactic  structures  as  it  can,  in  all  parts  of  the 
utterance,  using  these  words,  h  ai  the  words  of  the  sentence  have  been  found,  then  TREE  will 
construct  the  entire  sentence. 
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4.  CONNECT  Algorithm 

These  PPTs  are  then  tuned  over  to  the  CONNECT  algorithm  which  attempts  to  join  together 
pairs  of  the  parse  structures  by  proposing  word -strings  which  might  exist  between  them,  and 
testing  these  word-strings  out  at  high  phonetic  distance  against  the  input.  Lf  any  word-string 
matches  successfully  a  new  combined  parse  structure  is  created. 

B  TWO  HEURISTICS  USED  BY  I. PARS 

Two  heuristics  are  used  to  augment  the  basic  system  described  above. 

One  heuristic  aids  the  initial  local  word  recognition  process,  if  the  set  of  word  candidates 
output  by  the  initial  scan  and  the  LOCALMATCII  routine  leaves  substantial  gaps  in  the  utterance, 
then  the  system  reinitiates  ioca)  word  recognition  in  those  gaps.  It  rescans  the  gaps  at  a  higher 
phv  letic  distance,  and  processes  any  words  found  using  the  l.OCAl. MATCH  routine.  This  is  a 
simple  heuristic,  which  uses  very  local  information  to  direct  higher-distance  scans  in  areas 
where  further  attention  is  obviously  needeu  In  practice,  it  is  not  invoked  verv  often. 

The  second  heuristic  helps  the  CONNECT  algorithm  to  compensate  for  any  partially  errone¬ 
ous  structures  which  may  have  been  built  by  the  TREE  algorithm.  CONNECT  will  only  accept 
as  input  l’PTs  which  do  not  overlap,  and  which  are  within  some  maximum  number  of  phonemes 
of  one  another.  However,  the  TREE  algorithm  mav  have  attached  an  erroneous  word  onto  an 
otherwise  correct  structure.  To  compensate  for  this  possibility,  the  system  has  tv/o  functions 
(RIP-L  and  RiP-R)  which  rip  leftmost  and  rightmost  words  off  a  PPT,  thus  obtaining  smaller 
subsumed  structures. 

By  using  these  functions,  the  system  can  take  two  partially  overlapping  I'PTs  and  rip  words 
off  either  or  both,  to  obtain  two  smaller  nonoverlapping  structures  which  can  be  turned  over  to 
CONNECT.  Also,  even  if  the  PPTs  do  not  overlap,  the  system  can  rip  words  off  either  or  both 
so  long  as  the  number  of  phonemes  between  them  does  not  exceed  the  maximum  permissible, 
and  turn  the  smaller  PPTs  over  to  CONNECT. 

The  two  heuristics  deser  bed  above  are  useful  tricks  which  help  the  system  work  more  suc¬ 
cessfully.  They  do  not,  however,  make  anv  pretense  of  providing  a  systematic  means  of  finding 
the  correct  sentence,  if  processing  by  the  whole  of  I. PARS  fails. 

C.  SYSTEMATIC  KALI  BACK 

We  now  describe  a  systematic  method  for  fallback  available  to  I, PARS  if  the  initial  attempt 
at  sentence  recognition  fails.  This  fallback  method  is  exhaustive,  and  will  eventually  find  any 
sentence  (although  it  might  well  find  erroneous  sentences  first).  In  concept,  the  fallback  consists 
of  reactivating  the  entire  I  PARS  system  at  a  higher  set  of  phonetic  distance  thresholds 

Three  significant  phonetic  distance  thresholds  used  by  LPARS  are  those  used  by  (1)  the  ini¬ 
tial  scan,  (E)  the  LOCALALATCH  routine,  and  (3!  the  CONNECT  algorithm. 

These  thresnolds  arc  initially  set  at  levels  which  yield  reasonable  success  rates  without 
exhaustive  amounts  of  computation  However,  if  the  svstem  fails  to  find  a  sentence  using  the 
standard  levels  for  these  thresholds,  the  system  can  raise  the  thresholds  and  try  again.  Further¬ 
more,  it  can  do  this  without  having  to  repeat  most  of  the  work  already  done  (as  discussed  below 
in  Sec.  1).  In  fact,  1  PARS  can  continue  this  f  .llback  process  for  several  iterations  until  a  sen¬ 
tence  is  fc  Hid. 

In  theory,  of  course,  continuing  the  iterative  fallback  process  to  absurdly  high-distance 
thresholds  would  result  in  the  svstem  generating  every  possible  acceptable  sentence  that  it  could 
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construct  with  its  vocabulary,  since  eventually,  at  a  high  enough  threshold,  every  w ord  would 
match  everywhere  in  the  utterance.  In  practice,  one  must  deeide  upon  some  maximum  threshold 
level,  after  which  one  acknowledges  failure. 

The  remainder  of  this  section  discusses  how  one  might  allow  the  fallbaek  to  higher-distanee 
thresholds  to  take  place  without  requiring  that  all  the  previous  work  be  repeated. 

It  is  easy  to  achieve  this  goal  in  the  TREE  algorithm.  The  additional  words  found  by  the 
higher-threshold  initial  processing  are  given  to  TREE  together  with  the  old  words  and  the  pho¬ 
neme  position  table  containing  the  previously  constructed  PPTs.  TREE  ear  proeeed  with  the  pro¬ 
vision  that  now  each  syntactic  connection  must  be  made  either  between  any  PPT  and  a  new  word, 
or  between  a  PPT  conta  \ing  a  new  word  and  any  word.  In  other  words,  one  does  not  attempt  to 
connect  any  of  the  old  PPTs  to  any  of  the  old  words.  This  restriction  allows  TREE  to  find  all  pos¬ 
sible  syntactic  strueiures.  yet  the  work  done  in  the  previous  TREE  processing  remain  intact 

To  allow  fallbaek  to  higher-distance  thresholds  without  repeating  the  lexical  work  done  pre¬ 
viously  is  simple  in  concept,  but  costly  in  storage.  It  requires  that  eaeh  partial  lexical  match 
which  was  aborted,  because  it  exceeded  the  previous  threshold,  be  recorded  so  that  it  can  be 
restarted.  This  involves  a  great  deal  of  storage  because  the  initial  scan,  for  instance,  matches 
most  words  in  the  vocabulary  against  every  phoneme  position  in  the  sentenee.  Eaeh  one  of  these 
matching  attempts  usually  involves  several  substitution-deletion  combinations. 

Thus,  if  there  were  50  words,  50  phonemes  in  the  utterance,  an  average  of  10  partial  matches 
per  recognition  attempt,  and  if  two  words  were  required  to  reeorj  each  aborted  attempt,  then 
50,000  words  of  storage  would  be  required  tall  to  process  one  sentenee).  On  the  other  hand, 
perhaps  this  is  reasonable  if  one  considers  that  it  represents  the  state  of  25,000  parallel,  in¬ 
dependent  processes  that  have  been  temporarily  suspended.  l.PARS  at  present  does  not  imple¬ 
ment  this  approach,  but  rather  recomputes  all  the  lexical  matching  at  the  higher-distanee  thresh¬ 
olds  if  it  is  forced  to  use  this  fallback. 

D.  DESIGN  CONSIDERATIONS  I  OR  THE  t.PAKS  Si. STEM 

Sections  II  ‘hrough  V  discussed  the  various  sub-parts  of  the  l.PARS  system:  the  injiial  sear., 
the  LOCALMATCII  routine,  and  the  TREE  and  CONNECT  algorithms.  These  are  the  tools  which 
the  l.PARS  system  uses  to  recognize  an  utterance.  But  just  building  these  tools  is  not  enough. 

One  must  understand  what  the  various  tools  are  useful  for,  what  they  are  not  useful  for,  and  then 
design  the  svstem  as  a  whole  around  this  understanding. 

Therefore,  in  this  section  wc  examine  the  tools  which  have  been  developed  to  see  how  they 
are  best  used. 

1.  Two  Modes  of  Joining  PPTs 

Consider  the  task  of  examining  two  PPTs  developed  from  an  input  utterance,  together  with 
an  intervening  input  segment,  to  construct  a  combined  PPT.  There  are  two  fundamentally  dif¬ 
ferent  ways  to  proceed.  The  alternatives  couid  be  called  the  "l>ottom-up"  (or  "da'a -directed" ) 
approach  vs  the  "top-dowr."  (or  "grammar-directed")  approach. 

a.  Ilottom-l  p  (HU)  Connection 

In  HE  (or  data -directed)  connection,  the  system  s^ans  the  segment  hetween  the  *wo  PPTs  at 
a  high  phooe«.ic  distance,  and  then  tries  to  construct  a  combined  PPT  from  the  two  input  PPTs 
together  with  any  words  found  In  this  mode  of  connection,  the  previous  PPTs  are  used  to 
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indicate  a  section  of  input  to  be  examined  at  high  phonetic  distance,  but  not  to  indicate  explicitly 
what  word-strings  might  be  expected  to  occu-  there. 

b  Top-Down  Connection 

Top-down  (or  grammar-directed)  connection  is  that  performed  by  the  CONNECT  "routine 
described  in  Sec  V.  Here,  the  grammar  is  used  to  indicate  what  word-strings  might  possibly 
exist  in  the  gap,  and  only  these  specific  word-strings  are  tested  against  the  input. 

A  number  of  issues  might  be  discussed  to  evaluate  the  relative  merits  of  these  two  strate¬ 
gies.  Here,  we  look  at  only  one  of  these  issues,  namely,  how  the  two  strategies  perform  as  a 
function  of  the  amount  of  "context"  contained  in  the  two  PPTs.  To  illustrate  this  issue,  consider 
the  following  two  examples. 

Example  1 

Suppose  I.PARS  is  attempting  to  connect  the  two  PPTs  shown  in  Fig  VI-2 
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Fig.  Vi-2. 

Using  the  top-down,  grammar-directed  approach,  1-PARS  discovers  that  only  one  possible 
word-string  can  fill  this  gap,  namely  a  noun.  (Remember  that  only  word-strings  containing,  at 
most,  one  content  word  are  considered  )  Therefore,  it  compiles  a  list  of  appropriate  nouns  and 
tests  them  against  the  input  to  see  if  any  match. 

Example  2 

Suppose,  on  the  other  hand,  I.PARS  is  attempting  to  connect  the  two  PPTs  shown  in  Fig.  VI-3 
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Fig.  VI -  3  . 


Using  the  top-down,  grammar-directed  approach,  I.PARS  discovers  (even  with  its  very  simple 
grammar!  that  there  is  a  -cry  large  number  of  possible  word-string  connections;  these  include 
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(and  many  more). 

»s  a  result,  using  the  top-down  algorithm  with  these  two  PPTs  is  not  very  efficient  Doing 
so  will  exercise  almost  the  entire  grammar,  and  generate  almost  every  word-string  conceivable, 
each  of  which  must  be  matched  against  the  input  at  high  phonetic  distance.  Clearly,  this  approach 
is  not  very  selective  in  bringing  context  io  our  aid 

On  the  other  hand,  is  this  surprising?  After  ali,  a  noun  phrase  is  a  very  basic  b  ilding 
block  of  an  English  sentence,  it  can  occur  in  many  contexts,  and  is  structurally  the  same  in  each 
context.  Therefore,  it  is  reasonable  that  two  such  primitive  structures  can  be  connected  many 
ways. 

What  then  does  this  imply? 

(1)  Need  for  "Sufficient  Context" 

One  implication  is  that  we  should  not  use  the  top-down  CONNECT  algorithm  unless  the  PPTs 
involved  provide  "sufficient  context,"  in  some  sense,  to  make  the  word-string  possibilities  quite 
selective.  This  restriction  is  not  surprising  when  we  consider  that  the  CONNECT  algorithm  is 
a  very  powerful  tool;  given  any  two  parse  structures  conceivable,  it  processes  the  grammar  in 
a  very  rigorous  and  systematic  fashion  to  investigate  every  possible  way  that  they  might  be  con¬ 
nected.  The  price  paid  for  this  power  and  generality  is  that  we  must  be  selective  about  the  prob¬ 
lems  we  ask  the  routine  to  solve. 

This,  in  fact,  is  why  we  have  the  l.OCALMATCH  routine  As  currently  implemented  in 
LPARS,  the  l.OCALMATCH  routine  works  on  very  local  criteria  (words  and  word  pairs)  to  in¬ 
vestigate  the  areas  around  them.  (In  an  extension  to  the  LPARS  system,  this  routine  might  in¬ 
vestigate  somewhat  more-compiex  structures,  particularly  in  searching  for  particular  word- 
strings  which  are  semantically  quite  likely  to  be  present.) 

Moreover,  the  L.OCALMATCH  routine  investigates  around  and  between  fairly  simple  local 
constructs  in  a  way  that  avoids  a  very  systematic,  costly  processing  of  the  grammar.  Also,  it 
can  be  primed  to  look  primarily  for  highly  likely  connections.  Thus,  the  L.OCAI. MATCH  routine 
cuts  corners  (by  using  simple  tests  and  heuristics).  It  can  do  so  because  its  goal  is  not  neces¬ 
sarily  to  uncover  the  entire  utterance,  merely  substantial  portions  of  it  Once  these  portions 
are  uncovered,  the  full  power  and  generality  of  the  CONNECT  algorithm  is  standing  by  to  com¬ 
plete  the  job. 


(2)  Implication  for  Scan  Threshold  Levels 

A  second,  related  implication  is  that,  to  use  the  CONNECT  routine  efficiently,  we  must  tune 
the  system  so  that  the  PPTs  constructed  after  the  initial-local  processing  usually  have  sufficient 
context  to  be  input  to  that  routine.  Only  when  this  does  not  happen,  does  the  BU  connect  routine 
need  be  called,  or  the  more-general  fallback  be  invoked. 
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In  other  words,  to  use  the  CONNECT  routine  effectively,  the  phonetic  distance  thresholds 
of  the  initial  scan  and  LOCAL  MATCH  routine  must  be  set  so  that  a  high  percentage  of  the  cor¬ 
rect  words  in  the  utterance  is  usually  found. 

2.  Vocabulary  Density 

A  second  important  concept  that  enters  into  this  discussion  is  the  somewhat  vague  term 
"vocabulary  density."  Let  us  taky  two  examples  to  see  what  vocabulary  density  might  mean. 

la)  In  a  soarse  (non-dense)  vocabulary,  on'  might  set  the  local  scan  thresholds  to  un¬ 
cover  on  the  a  erage  85  percent  of  the  correct  words  in  the  sentence,  and  generally 
only  have  a  few  incorrect  words  found. 

(b)  In  a  den.  er  vocabulary,  when  one  sets  the  local  scan  thresholds  so  that  85  percent 
of  the  correct  words  are  found,  a  large  number  of  incorrect  words  are  usually 
found  as  well. 

This  is  certaiiily  a  highly  specific  concept  of  vocabulary  density,  but  a  useful  one  for  our 
purposes. 

a.  Factors  Influencing  Density 

There  are  clearly  a  number  of  factors  influencing  vocabulary  density;  these  includ< 

(1)  vocabulary  size 

(2)  front-end  accuracy 

(3)  average  size  of  word 

(4)  amount  of  semantic  selectivity  which  can  be  brought  to  bear  in  the  initial  local 
processing 

(5)  utilization  of  supra- segmental  information. 

b.  Embedding  of  Correct  Sentence  Fragments 

Vocabulary  density  affects  the  two  connection  strategies  discussed  above  because  it  influences 
the  degree  to  which  correct  sentence  fragments  are  likely  to  be  embedded  in  larger  structures 
which  include  some  of  the  erroneous  words. 

The  more  erroneous  words  discovered,  the  more  likely  it  is  that  some  of  these  words  will 
become  attached  by  the  TREE  algorithm  to  a  string  of  correct  words.  Of  course,  if  all  the  cor¬ 
rect  words  are  found,  or  if  the  erroneous  word  overlaps  with  a  correct  word  which  has  been 
found,  no  harm  will  be  done,  in  general,  however,  the  presence  of  incorrect  words  attached  to 
correct  sentence  fragments  makes  the  job  of  isolating  the  correct  sentence  fragments  difficult, 
and  therefore  complicates  the  job  of  finding  the  appropriate  input  segments  to  be  investigated 
next. 


c.  How  CONNECT  Deals  with  Embedding 

The  top-down  CONNECT  algorithm  can  deal  with  this  embedding  problem  by  using  the  RIP-R 
and  RIP-L  routines  described  in  Sec.  B  above.  In  so  doing,  the  system  postulates  that  certain 
fragments  are  correct  embedded  fragments,  and  tries  to  connect  them.  The  denser  th«  vocab¬ 
ulary,  the  more  such  trials  the  algorithm  would  have  to  make  with  a  given  set  of  PPTs.  The 
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degree  to  which  this  approach  is  practical  depends  on  an  evaluation  of  the  work  involved  in  postu¬ 
lating  embedded  fragments  and  in  processing  them  with  the  CONNKCT  algorithm. 

The  BU  algorithm  has  potentially  even  more  difficulty  working  in  the  presence  of  a  great 
deal  of  embedding.  This  difficulty  arises  because,  in  investigating  a  set  of  PPTs  for  the  c  on- 
nection  of  embedded  fragments,  one  might  investigate  a  number  of  different  input  segments  (per¬ 
haps,  in  total,  consuming  a  substantial  par*,  of  the  input).  With  TL)  connection,  during  such  in¬ 
vestigation  a  great  deal  of  selectivity  is  brought  to  bear,  as  far  as  the  word-strings  looked  for  in 
different  places.  With  the  Bl’  algorithm,  however,  such  selectivity  is  impossible.  If  a  large  por¬ 
tion  of  the  input  is  involved,  one  probably  might  just  as  well  rescan  the  v/holt  input  at  a  high  pho¬ 
netic  distance. 

Thus,  the  denser  the  vocabulary,  the  more  difficult  it  is  for  both  algorithms  to  work 
efficiently. 

The  vocabulary  used  in  LPAR5  during  its  testing  was  fairly  sparse,  so  these  problems  did 
not  occur  to  too  large  a  degree.  When  experimenting  with  the  system  with  higher  scrambling 
levels,  and  using  smaller  words,  the  problems  were  observed  and,  since  they  were  interesting 
problems,  this  section  has  tried  to  isolate  and  summarize  them.  The  present  system  has  only 
attempted  to  attack  these  problems  in  a  fairly  limited  fashion. 

3.  c.urrent  I  PARS  Approach 

During  evaluation  of  the  I. PARS  system,  described  in  Sec.  VII,  only  the  grammar-directed 
CONNECT  algorithm  was  used,  and  only  PPTs  containing  "sufficient  context"  were  passed  to  this 
algorithm.  A  PPT  was  deemed  to  have  "sufficient  context"  (a)  if  it  included  a  STRC11  or  an 
ENDCH,  (b)  if  It  was  not  a  single  word,  and  (c)  if  it  was  not  a  NP  or  partial  NP  When  using  a 
more-complex  grammar,  of  course,  these  tests  would  probably  need  to  be  more  sophisticated 

The  phonetic  distance  thresholds  were  set  so  that  most  of  the  correct  words  were  found  after 
the  initial  processing.  Thus,  in  general,  PPTs  containing  sufficient  context  were  built  up  and 
could  be  passeu  to  the  CONNECT  algorithm.  If  not,  the  system  would  full  back  and  iterate  at 
higher  phonetic  distance  thru  .olds. 
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VII.  EXPERIMENTAL  EVALUATION  OF  LPARS,  AND  SAMPLE  RUNS 


This  section  describes  the  experimental  evaluation  of  the  LPARS  system,  and  then  discusses 
a  few  samole  runs  which  illustrate  LPARS1  activity  when  investigating  a  sentence. 

A.  EXPERIMENTAL  EVALUATION 

During  the  e  perimental  evaluation  of  LPARS,  fifty  sentences  were  input  to  the  system  to  be 
reeognized  (the  sentences  used  are  listed  in  Appendix  B).  The  following  steps  were  taken  during 
the  recognition  of  each  sentence. 

1.  Scrambling:-  First,  the  correct  phonetic  spelling  of  the  sentence  is  given  to  the  scram¬ 
bling  program  (the  front-end  simulator)  which  transforms  the  sentence  into  a  string  of  "scram¬ 
bled"  phonemes  containing  a  great  deal  of  error  in  the  form  of  deletion  and  substitution.  The 
statistics  used  in  performing  this  scrambling  are  exactly  those  described  in  Sec.  III.  Sinee  the 
scrambling  is  based  on  random  error  probabilities,  some  sentences  produced  contain  relatively 
small  amounts  of  error,  while  others  contain  a  great  deal. 

2.  Initial  Local  Processing:-  The  string  of  phonemes  is  then  input  to  the  I  PAR?  system 
whieh  performs  its  initial  scan  and  higher-distance  local  matching,  attempting  to  find  most  of 
the  words  in  the  sentence.  The  phonetic  distance  thresholds  of  these  scans  were  set  so  that 
approximately  85  percent  of  the  correct  words  are  usually  found  bv  this  loeal  processing. 

3.  PPT  Construction:-  Next,  the  words  found  by  the  initial  local  processing  are  input  to 
the  TREE  algorithm  which  buiids  as  many  different  parse  structures  as  it  can  in  all  parts  of  the 
utterance  (as  described  in  See.  V-C).  If  one  or  more  entire  sentence  is  found,  then  LPARS  pro¬ 
ceeds  no  further. 

4.  PPT  Connection:-  Next,  pairs  of  PPTs  are  input  to  the  CONNECT  routine  which  attempts 
to  construct  combined  PPTs  as  described  in  Sec.  V-D.  Any  such  combined  PPT  which  is  found 

is  included  in  the  set  of  PPTs  being  investiga.  -d.  if  this  process  discover  one  or  more  possible 
sentences,  then  LPARS  proceeds  no  further. 

5.  Iteration  at  Higher  Thresholds:-  if  the  PPT  connection  is  unsuccessful,  then  LPARS 
raises  the  thresholds  at  which  the  various  scans  arc  made  and  repeats  steps  2,  3,  ar  1  4  again. 
During  the  evaluation,  the  phonetic  distance  thresholds  used  for  the  initial  scan,  the  local  higher- 
distance  matching,  and  the  PPT  connection  were  17,  24,  and  34,  respectively  during  the  first 
iteration;  and  22,  26,  and  39  during  tiie  second  iteration.  If  no  sentence  has  been  found  at  the 
end  of  the  second  iteration,  then  processing  stops  and  failure  is  acknowledged. 

An  attempt  to  recognize  a  given  sentence,  as  outlined  above,  can  result  in  any  of  the  follow¬ 
ing  results. 

(1)  An  entire  sentence  is  constructed  by  the  TREE  algorithm  from  'be  words  found  by 
the  local  processing. 

(2)  A  sentence  is  found  by  the  algorithm  that  connects  Pl-Ts. 

(3)  No  sentence  is  found  at  all. 

Processing  ceases  at  the  end  of  any  stage  in  which  one  or  more  possible  sentences  is  found. 
When  sentences  are  found,  it  may  be  during  either  the  first  or  second  iteration  of  the  system. 

Of  course,  it  may  be  that  none  of  the  sentences  found  is  correct. 
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In  evaluating  LPARS,  if  several  possible  sentences  were  found,  the  recognition  of  the  sen¬ 
tence  was  considered  successful  if  the  correct  sentence  was  includes  among  them.  \\  hen  several 
sentenc  '  were  found,  the  correct  sentence  was  almost  always  the  best  match. 

1  Evaluation 

During  the  actual  eva'uation  of  LI  ARS,  fifty  sentences  were  input  with  ‘he  following  results: 

-3  sentences  were  correctly  recognized  from  v\  -ds  found  only  by  the  local  process¬ 
ing  in  the  first  iteration. 

19  sentences  we  correctly  recognized  by  he  PPT  CONNECTION  algorithm  in  the 
first  iteration. 

I  sentences  were  correctly  recognized  in  the  second  .teration  of  the  system. 

Z  sentences  resulted  in  incorrect  recognition,  lr.  both  ca.es,  the  sentence  found 
differed  from  the  input  sentence  in  a  single  content  word. 

3  sentences  resulted  in  failure  to  find  any  possible  sentence  at  all.  In  two  cases, 
two  .dja  ort  content  words  were  badly  garbled,  and  therefor  the  CONNECT  algo¬ 
rithm  was  unable  to  propose  the  correct  word-string  In  the  third  case,  LP.cRS 
crashed  irretrievable  due  to  a  program  bug. 

Thus,  the  overall  success  rate  of  LPARS  with  the  fifty  input  sentences  was  90  percent. 

2.  Comments  on  L.PARS'  Evaluation 

Tee  LPARS  evaluation  described  aoove  primarily  establishes  that,  g”/en  a  simulation  of  a 
fairly  crude  front  end,  the  ideas  embodied  in  LPARS  can  be  made  to  work  acceptably. 

Certa.nly  there  ait)  a  number  of  interesiing  fu: trier  questions  which  one  might  like  to  ask, 
such  as  how  the  overall  system  performance  is  affected  by  varying  such  parameters  as: 

(a)  scrambling  probabilities 

(b)  scrambling  characteristics 

(c)  vocabulary  size 

(d)  word  size 

(e)  the  phonetic  distance  1<  els  of  the  various  stins. 

It  would  be  very  difficult  to  pursue  such  issues  systematically  with  the  cui  rent  implementa¬ 
tion  if  1  PARS  since  it  is  ve:  r'->w.  Often,  10  to  15  minutes  or  more  of  dedicated  CPC  time  is 
required  per  sentence.  (This  inefficiency  is  not  inherent  in  the  LPARS  design,  but  rather  of  ‘he 
current  implementation.) 

In  any  case,  the  primary  goal  of  the  present  reseal  h  is  on  evel  .ping  an  approach  to  parsing 
which  allows  syntactic  analysis  to  proceed  systematically  from  anywhere  in  a  sentence,  and  to 
understand  the  p-oblems  and  issues  involved  in  such  an  approach. 

l.i  the  present  evalua  :.on  of  lPAR-S,  we  have  described  a  form  of  scrambled  input,  argued 
that  it  has  the  basic  characteristics  of  spoben  input  for  our  present  purposes,  and  then  demon¬ 
strated  t  at  1  pAR,S  can  operate  successfully  given  such  input.  This  success  demonstrates  that 
the  ideas  embedded  in  LPARS  are  fundamentally  sound  and  coherently  formulated,  and  that,  for 
instance,  there  are  no  unforeseen  factor*-  whi'-h  somehow  invalidate  the  approach  taken. 


B.  LPARS  SAMPLE  RUNS 


We  n>w  exhibit  a  number  of  sample  runs  of  the  LPARS  system,  illustrating  various  aspects 
of  the  sysum's  operation.  In  these  examples,  comments  output  by  the  LPARS  system  appej-  in 
capitals.  Additional  comments  in  small  letters  have  been  added  by  the  author  to  make  the  exam¬ 
ples  more  intelligible. 

Example  i 

This  example  shows  a  sentence  which  is  recognized  entirely  from  words  found  by  the  initial 
local  processing. 

(UNSCRAMBLED  SENTENCE) 

(THE  VV ASTEBASKET  WHICH  THE  GREEN  COFFEETABLE  SUPPORT  S 
CONTAIN  S  THE  DICTIONARY) 

(TH  SW  W  AA  S  T  B  E  S  K  SW  T  V  I  CI1  Til  SW  G  R  EE  N  K 
AW  F  EE  T  AA  B  L  S  SW  P  O  R  T  Z  K  SW  N  T  AA  N  Z  TH  SW 
()  1  K  SH  S.V  N  AA  R  EE) 

(SCRAMRI  ED  SENTENCE) 

(T  Y  W  AW  D  P  E  7.H  P  ZH  W  I  CH  Y  L  I  NG  G  AA  B  EE  P  AW 
P  R  H  SW  T11  O  L  P  J  B  SW  U  T  AW  M  II  C  SW  G  TIi  Z  Y  M  AW 
I  EE) 


(INITIAL  SCAN) 

(  (STRCH  0  0)  (SI  ’LADDER  8  15  150)  (COFFEETA  Rt  F  18  25  15) 
(SUPPORT  26  31  5)  (CONTAIN  33  38  10)  (DICTIONARY  42  49  95' 
(ENDCH  50  50)  ) 


The  initial  scan  finds  three  correct  ai.d  one  erroneous  words.  The  words  are  ordered  as 
they  appear  ,n  the  sentence.  (Recall  that  he  three  numbers  associated  with  each  word  candidate 
are  start  phoneme,  end  phoneme,  and  total  phonetic  distance  of  the  match.) 

(LOCAI  HIGHER  DISTANCE  MATCHING) 

(  (STRCI1  0  0)  (CONTAIN  1  5  »)  (T11E  1  2  0)  (PLACE  3  5  90) 

(WASTEBASKET  3  10  185)  (THE  6  7  0)  (STEPI .ADDER  8  15  ISO) 

(V  »1CII  11  12  12)  (CLUTTER  11  14  ,rj)  (THE  13  14  12) 


(THE  15  17  0)  (GREEN  15  17  60)  (COFFEETABL.E  18  25  15) 

(SUPPORT  26  31  5)  (CLUTTER  30  32  110)  (ROBERT  30  33  120) 

(THE  32  33  0)  (THE  33  33  0)  (CONTAIN  33  38  10) 

(BOOKCASE  34  38  110)  (ENJOY  34  38  120)  (THE  39  41  0) 

(PERSON  42  46  105)  (DICTIONARY  42  49  95)  (ENOCH  50  50)  ) 

A  number  of  additional  words  are  found  by  the  higher-distance  matching,  including  all  the 
correct  words  in  the  sentence.  Next,  the  TREE  algorithm  constructs  all  possible  local  partial 
parse  structures  from  the  words  found.  These  trees  are  printed  ou*  below. 

(PARTIAL  PARSE  TREES  SORTED  BY  LENGTH) 

(STRCH  (  (THE  WASTEBASKET  (WHICH  (THE  GREEN  COFFEETABI.E) 

StPPORT  AFX))  CONTAIN  AFX  (THE  DICTIONARY))  ENOCH) 

(*  (*  (<■  (THE  GREEN  COFFEETABI. E)  SUPPORT  AFX) 

CONTAIN  AFX  (THE  DICTIONARY)  )  EN.XfH) 

(.  (4  r  (THE  GREEN  COFFEETABI.  :,  SUPPORT  AFX) 

ENJOY  AFX  (THE  DICTIONARY)  )  ENOCH) 

(STRCH  (  (THE  WASTEBASKET)  CLUTTER  AFX  (THE  COFFEETABI.E)  )  ) 

(*  (*  (T.IE  GREEN  COFFEETABI.E)  SUPPORT  AFX)  ENJOY 
AFX  (THE  PERSON)  ( 

(*  (*  (CLUTTER  A  CONTAIN  AFX  (THE  DICTIONARY)) 

ENOCH) 

(*  (ROBERT  ENJOY  AFX  (THE  DICTIONARY!  )  ENOCH) 

*  <*  ROBERT  ENJOY  AFX  (THE  DICTIONARY)  )  ENOCH) 

(*  I*  (CLUTTER  AFX!  ENJOY  AFX  (THE  DICTIONARY)  )  ENOCH) 

(*  (THE  D'CTIONARY)  ENOCH) 

(STRCH  (CONTAIN  AFX  (TIIE1  )  ) 

(ROBERT  ENJOY  AFX  (THE  PERSON)  ) 

(*  ROBERT  ENJOY  AFX  (THE  PERSON)  ) 

(*  (CM  TTER  AF  ')  ENJOY  AFX  (THE  PERSON)  ) 

(PLACE  AFX  (THE  ST EPI .ADDER)  ) 

(CLUTTER  AFX  (THE  BOOKCASE)  ) 

The  sym'jol  *  means  that  the  following  son  is  an  ancestor  link  son.  The  symbol  AFX  repre¬ 
sents  an  AFFIX  arc  which  has  been  traversed.  (Any  LAMBDA  arcs  traversed  are  not  indicated 


in  the  printout, )  Thus,  for  instance,  the  PPT  (*  (THE  DICTIONARY)  ENDCH)  represents 
the  structure  shown  in  Fig.  VII- i . 


SENT1 


NP 

A 

the  dictionary 


ehdch 


Since  an  entire  s 


Fig.  VII-1. 

ce  (see  Fig.  V 1 1  -  2 )  is  among  these  PPTs,  LPARS  proceeds  no  further. 


(SENTENCES  FOUND) 

(STRCII  (  (THE  WASTEBASKET  (WHICH  (THE  GREEN  COFFEETABLE) 
SUPPORT  AFX)  )  CONTAIN  AFX  (THE  DICTIONARY)  )  ENDCH) 


strcii 


*hich 


support? 


NP 

A 

the  green  cofSetoble 


ertdch 


the  dictionary 


Fig.  V 1 1  -  2 . 


Example  Z 


( U  NSC  RAM BUED  S ENT  ENO  E) 

(THE  COFFEETABLE  NEXTTf  TEE  BOOKCASE  IS  ILLUMINATE  I) 

BY  THE  TABLELAMP) 

(T II  SW  K  AW  F  EE  T  AA  B  i  N  SW  K  S  T  (X)  Til  SW  B  I  K  AA 
S  I  Z  I  I.  OO  M  I  N  AA  'I  THAI  TII  SW  T  AA  B  I  AE  M  PI 

(SCRAMBLED  SENTENCE) 

(SW  C.  AA  V  EF  G  AW  V  K  I  SW  T  Z  P  (X)  SW  DU  U  G  \U  S  .1 
S  I  II  OO  N  I  M  A  A  .T  P  A  EE  F  SY,  I>  A  A  K  •  i  SW) 
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(INITIAL  SCAN) 

(  (STRCII  0  0)  (COFTEETARI  E  2  9  15)  (TABI.ELAMP  6  12  115) 
((.KEEN  8  11  50)  (NEXTTO  lu  15  40)  (B(X)KCASE  J7  21  5) 

(II  I.l  MIN  ATE  24  31  5)  (ENOCH  43  43)  ) 


Here,  four  correct  ard  two  erroneous  words  are  found.  (Note  that  the  word  candidate 
"tablelamp"  is  erroneous  since  it  is  in  the  wrong  place.) 

(LOCAL  HIGHER  DISTANCE  MATCHING) 

(  (STIICII  0  0)  (THE  1  1  0)  (COFFEETABLK  2  9  15) 

(TAI3I.EI .AMP  6  12  115)  (GREEN  8  11  50)  (IS  10  12  40) 

(NEXTTO  10  15  40)  (SCPPOKT  13  17  140)  (THE  16  16  0) 

(BOO*  CASE  17  21  5)  (IS  22  23  0)  (ILLUMINATE  24  31  5) 

(THE  32  34  0)  (BOOKCASE  35  39  115'  (ENOCH  43  43)  ) 

9 

In  this  case,  not  all  the  words  are  found  by  the  initial  local  processing  Therefore,  the 
system  constructs  only  the  partial  structures  shown  below. 

(PARTIAL  PARSE  TREES  SORTED  BY  LENGTH) 

(STRCII  (  (THE  COFFEETABLK  (NEXTTO  (THE  BOOKCASE)  )  )  IS 
ILLUMINATE  AFX)  ) 

(STRCII  (  (THE  COFFEETABLE  (NEXTTO)  )  )  ) 

(This  PPT  is  not  subsumed  by  the  previous  one,  ever,  though  it  might  appear  to  be. 

It  represents  the  PRC  structure  "the  coffeetable  nextto  which.  .  .".) 

(STRCII  (  (THE  COFFEETABLE)  IS)  ) 

(*  (NEXTTO  (THE  BOOKCASE)  )  IS  ILLUMINATE  AFX) 

(*  (THE  B(X)KCASE)  IS  ILLUMINATE  AFX) 

(  (THE  BOOKCASE)  IS  ILLUMINATE  AFX) 

<*  TABI.ELAMP  SUPPORT  AFX) 

ENDCIi 

(IS  SUPPORT  AFX) 

(THE  BOOKCASE) 

GREEN 

The  system  must  now  rse  the  CONNECT  algorithm  to  attempt  to  discover  an  entire  sentence. 


(PPT  CONNECTION) 


(PPT  PAIR) 


(STRCH  (  (THE  COFFEETABLE  (NEXTTO  (THE  BOOKCASE)  )  ) 
IS  ILI  I'M  IN  ATE  AFX)  ) 

ENDCH 


PPTS  TO  BE  CONNECTED 

(STRCH  (  (THE  COFFEETABLE 
IS  ILLUMINATE  AFX)  ) 
ENDCH 


(NEXTTO  (THE  BOOKCASE)  )  ) 


The  system  now  calls  the  CONNECT  algorithm  with  the  two  PPTs  shown  in  Fig.  VII-3. 


NP 

the  coffeetob  e  PP 

/ 

/ 

nextto 


is  illunurjte 


NP 

/\ 


the  bookcase 


endch 


Fig.  VII-3. 


PROPOSED  PPT 

(STRCH  (  (THE  COFFEETABLE  (NEXTTO  (THE  BOOKCASE)  )  ) 
IS  ILLUMINATE  AFX  BY  NP)  ENDCH) 


A  proposed  combined  PPT  is  discovered  (see  Fig.  VII-4) 


strch' 


NP 

A 

the  coffee toble 
next  to  the  bookcose 


Fig.  VII-4. 

Ail  word-strings  which  match  the  hypothesized  portion  of  this  proposed  PPT  are  tested 
against  the  input.  To  help  us.  I.PARS  types  out  one  such  word-string. 


POSSIBLE  WORD  STRING  (D  BY  THE  C HANDEL. IER) 


This  word-string  do<.s,  in  fact,  match. 


WORD  STRING  MATCHED  (  (D  BY  THE  CHANDELIER)  32  42  395) 
COMBINED  PPT 

(STRCH  (  (THE  COFFEETABI  E  (NEXTTO  (THE  BOOKCASE)  )  ) 

IS  ILLUMINATE  D  BY  (THE  CHANDELIER'  )  ENDCII) 


So  do  others.  (Notice  that  only  semantically  reasonable  nouns  are  matched.) 

WORD  STRING  MATCHED  (  (D  BY  THE  FLASHLIGHT)  32  42  415) 

COMBINED  PPT 

(STRCH  (  (THE  COFFEETABI  E  'NEXTTO  (THE  BOOKCASE)  )  ) 

IS  ILLUMINATE  D  BY  (THE  FLASIII  IGIIT)  )  ENDCH) 

WORD  STRING  MATCHED  (  (D  BY  THE  TAHLELAMP)  32  42  225) 

COMBINED  PPT 

(STRCH  (  (THE  COFFEETABI  F  (NEXTTO  (THE  BOOKCASE)  )  ) 

IS  ILLUMINATE  D  BY  (THE  TAHLELAMP)  '  ENDCII' 

Three  possible  word-strings  match  within  the  high  phonetic  distance  of  this  match.  Notice, 
however,  that  the  word-string  containing  "ta'olelamp"  matches  at  the  lowest  phonetic  distance. 

This  example  illustrates  an  interesting  aspe-t  of  LPARS.  Even  though  words  in  the  vocabu¬ 
lary  -e  not  very  close  phonetically,  wher  LPARS  is  performing  matches  at  very  high  phonetic 
distances,  often  several  word-strings  mr.v  match.  This,  of  course,  is  not  too  surprising.  The 
more  garbled  s  section  is,  the  more  difficult  it  would  be  to  determine  which  words  match  there. 

To  deal  with  this  problem,  a  system  would  have  to  be  abi ;  to  do  ore  of  thre  things-  call 
front-end  routines  to  help  choose  between  the  different  possibilities,  so  semantics  to  rule  some 
out,  or  ask  the  speaker  which  he  actually  meant. 

The  advantage  of  a  vertically  organized  system  like  1  PAILS  is  that  it  only  investigates  h 
segments  very  selectively-  when  a  grea*  deal  of  contextual  information  indicates  the  necessity, 
and  when  a  great  deal  of  context  can  be  used  to  help  constrain  the  words  that  might  be  present 

Example  3 


(I  NSC  RAMBLED  SENTENCE) 

(PLACE  THE  GREEN  DICTIONARY  ON  THE  BOOKCASE! 

(P  I  AA  S  Til  SU  G  R  EE  \  1)  1  K  SI!  SW  N  A  A  II  EE  AU  N  TH 
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SW  I!  U  K  A  A  S) 


(SCRAMBLED  SENTENCE) 

(I-  AA  D  M  SVV  B  \V  EE  F  EE  K  H  SW  N  AVV  R  EE  AW  NG  B  Y  K 
U  P  AW  ZH) 

(INITIAL  SCAN) 

(  (STRCII  0  0)  (DICTIONARY  9  M  IS)  (CL'  r.  ER  20  2)  65) 

(BOOKCASE  22  26  5)  (ENDCII  27  27)  ) 


(LOCAL  HIGHER  DISTANCE  MATCHING) 

(  (STP.CH  0  0)  (THE  1  2  0,  (FRIENDLY  3  8  135)  (GREEN  6  8  90) 
(DICTIONARY  9  17  15)  (ON THELEFTOF  18  24  185) 

(ONTOPOF  18  24  120)  (ON  18  19  5)  (IS  18  10  0)  (THE  20  21  5) 
(CLUTTER.  20  23  65)  (BOOKCASE  22  26  5)  (ENDCH  27  27)) 

(PARTIAL  PARSE  TREES  SORTED  BY  LENGTH) 

<*  («  (GREEN  DICTIONARY)  (ON  (THE  ...  DKCASE)  )  )  ENDCII) 

(>  (GREEN  DICTIONARY  (ON  (THE  BOOKCASE)  )  i  ENDC1II 
r  (ON  (TIIF  BOOKCASE)  )  ENDCII) 

(STRCII  (  (THE  UIENDI.Y)  )  ) 

(*  (GREEN  DICTIONARY)  (ONTIIEI. EFTOF)  ) 

(GREEN  DICTIONARY  (ONTI1EI  EFTOF!  I 
(GItEEN  DICTIONARY  (ONTIIEI  EFTOF)  i 
(*  (GREEN  DICTIONARY)  (ONTOPOF)  ) 

(GREEN  DICTIONARY  (ONTOPOF)  ! 

(GREEN  DICTIONARY  (ONTOPOF)  ) 

(*  (GREEN  DICTIONARY)  IS  CLUTTER  AFX) 

(*  (THE  BOOKCASE)  ENIKTD 
(C.REEN  DICTIONARY  (ON.  ) 

(IS  (TIIE  BOOKCASE)  ) 


In  this  example,  when  it  comes  time  to  connect  PPTs,  two  PPTs  partially  overlap,  so  the 
system  rips  words  off  to  try  to  connect  subtrees.  First,  it  rips  "the  friendly"  of  the  left-hand 
PPT,  and  attempts  connection  with  the  two  resulting  structures  (=,"•“  Figs.  VIl-l  and  V 1 1  -  6 ) . 


(PPT  CONNECTION) 

(PPT  PAIR) 

(STRCH  (  (THE  FRIENDLY)  )  ) 
(*  (*  (GREEN  DICTIONARY) 


SENT1 


(CN  (THE  BOOKCASE)  )  )  ENDCH) 


SENT! 


PP 


I  A 

NP  on  the  bookcase 

A 

green  dictionary 
Fig.  V1I-5. 


PPTS  TO  BE  CONNECTED 
STRCH 

(*  <*  (GREEN  DICT’ONARY)  (ON  (THE  BOOKCASE)  )  )  ENDCH) 


s*  ch 


SENTl 

*/  ^ 

SENT 


endch 


NP 

A 

green  dictionary 


PP 

A 

on  the  bookcase 


Fig.  VJI-h. 


PROPOSED  PPT 

(STRCH  <WH  VERB  AFX  (DET  GREEN  DICTIONARY) 
(ON  (THE  BOOKCASE)  )  )  ENDCH) 


POSSIBLE  WORD  STRING  (WHO  PLACE  S  THE) 


The  first  possible  combined  PPT  hypothesized  is  a  question  (Fig.  VI1-7). 


SENT1 


Fig.  VII-?. 


PROPOSED  PPT 

(STRrH  (VERB  AFX  (DET  GREEN  DICTIONARY) 
(ON  (THE  BOOKCASE)  )  )  ENOCH) 

POSSIBLE  WORD  STRING  (PLACE  O  THE) 


WORD  STRING  MATCHED  (  (PLACE  O  THE)  1  5  155) 

COMBINED  PPT 

(STRCH  (PI.ACE  O  (THE  GREEN  DICTIONARY)  (ON  (THE  BOOKCASE)  )  ) 
ENDCH) 


SENTI 


A 


del  green  dictionory 


A 

on  the  bookcose 


Fig.  VII-8. 


The  next  possibility  is  an  imperative  command.  One  of  these  word-strings  matches  and  a  com- 
b’necl  PPT  is  constructed  (Fig.  VII-8).  (The  word  "O"  is  the  root  affix,  the  null  string.) 

(These  examples  stop  when  the  correct  sentence  is  fomd.  The  algorithm  itse'f  is  set  up  to 
continue  to  explore  different  possible  connections.  It  is  qi  ite  striking,  however,  watching  LPARS 
work,  how  quickly  the  system  cat.  often  zero-in  on  promising  connections,  once  given  a  set  of 
PPTs.) 

Example  4 

In  this  example,  the  CONNECT  algorithm  is  called  to  match  word-strings  in  the  middle  of 


an  utterance 


(UNSCRAMBLED  SENTENCE) 

(THE  BROWN  SIDETABLE  BESIDE  THE  FIREPIJVCE  CONCEAL  S  THE 


ARMCHAIR) 

(TH  SW  BRA  OO  N  S  A  I  D  T  AA  B  L  B  EE  SAID  TH  SW 
F  A  I  R  P  L  AA  S  P  SW  N  S  EE  L  Z  TH  SW  A  R  M  CH  AA  R) 


(SCRAMBLED  SENTENCE) 

(I)H  I  R  E  OO  NG  Z  AR  EE  N  AW  V  D  EE  AW  AE  E  TH  Y  P  A  I 
W  P  AW  H  D  SW  W  H  EE  R  H  V  A  SW  M  AW  R) 


(INITIAL  SCAN) 

(  (STRCII  0  0)  (FIREPLACE  20  26  95)  (CONCEAL  27  32  15) 
(ENDCH  40  40)  , 


(LOCAL  HIGHER  DISTANCE  MATCHING) 

(  (STRCH  0  0)  (FIREPI ACE  20  26  95)  (IS  27  28  0) 
(CONCEAL  27  32  15  (CLUTTER  29  St  *20)  (THE  32  34  01 
(THE  33  34  0)  (Ah.MCHAIR  35  39  1251  (ENDCH  40  40)  ) 


No  words  have  been  found  between  phonemes  0  to  20,  so  an  immediate  nighei-distance  scan 
is  made. 

(SPECIAL  SCAN) 

(  (THE  1  2  0)  (BROWN  3  6  55)  (SIDETABLE  7  13  t40)  ) 


PARTIAL  PARSE  TREES  SORTED  BY  LENGTH) 

(  (  FIREPLACE  CONCEAL  AFX  (THE  ARMCHAIR))  ENOCH) 

(STRCII  (  (TI1E  BROWN  SIDETABLE)  )  ) 

(*  (THE  ARMCHAIR)  ENDCH) 

(*  FIREPLACE  IS  CLUTTER  AFX) 
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(PPT  CONNECTION) 


(PPT  PAIR) 

(STRCH  (  (THE  BROWN  SIDETABLE)  )  ) 

(*  '*  FIREPLACE  CONCEAL  AFX  (THE  /  ’MCHAIR)  ) 
ENDCH) 


PPTS  TO  BE  CONNECTED 

(STRCH  (  (THE  BROWN  SIDETABLE)  )  ) 

(*  (*  FIREPLACE  CONCEAL  AFX  (THE  ARMCHAIR)  ) 
ENDCH) 


sirch' 


SENT  I 


NP 

A 

trie  brown  sideloble 


fireplace 


endch 


Fig.  VII-9. 


PROPOSED  PPT 

(STRCH  (  (THE  BROWN  SIDETABLE  (PREP  (DET  FIREPLACE)  )  ) 
CONCEAL.  AFX  (THE  ARMCHAIR)  )  ENDCH) 

POSSIBLE  WORD  STRING  (INFRONTOF  THE) 


SENT1 


siren 


fhe  brown  sidetable  PP 

A 

prep  del  fireploce 


endch 


Fig.  VII-10. 


87 


WORD  STRING  MATCHED  (  (BESIDE  THE)  14  19  200) 

COMBINED  PPT 

(STRCH  (  (THE  BROWN  SIDETABLE  (BESIDE  (THE  FIREPLACE)  )  ) 
LAM  CONCEAL  AFX  (THE  ARMCHAIR)  )  ENDCH) 


Vlll.  CONCLUSIONS,  BASIC  1SSI  ES,  AND  AltLAS  FOR  FURTHER  WORK 


This  section  places  the  present  work  into  br  dec  focus  by  discussing  some  interesting 
issues  brought  to  light.  Most  of  these  issues  relate  to  the  local  nature  or  LPARS'  design.  In  this 
discussion,  we  describe  some  of  the  fundamental  problems  involved  in  designing  a  system  like 
LPARS,  and  how  the  approach  described  solves  these  problems.  Finally,  wc  conclude  by  dis¬ 
cussing  a  number  of  interesting  areas  for  further  work. 

A.  BASIC  ISSUES  BROUGHT  TO  LIGHT 

1.  Advantages  of  a  Local  Approach 

The  most  obvious  issue  concerns  the  relative  merits  of  LPARS'  local  approach  which  has 
the  disadvantage  of  requiring  a  great  deal  of  initial  lexical  processing.  The  entire  vocabulary  is 
matched  against  the  entire  input.  If  v  ords  had  been  recognized  with  sufficient  accuracy  in  the 
early  part  of  the  sentence,  some  of  this  lexical  work  might  not  really  be  necessary. 

The  most  obvious  advantage  of  the  local  approach  over  a  more  left-to- right  approach  is  that 
the  left-to-right  approach  would  have  difficulty  if  a  relatively  large  amount  of  error  occurred 
near  the  beginning  of  the  sentence.  It  would  be  unable  to  get  at  the  necessary  context  to  help  it 
deal  with  this  ambiguity,  to  isolate  it,  and  then  make  specific  suggestions  of  words  to  look  for. 

A  local  approach,  on  the  other  hand,  allows  information  on  both  sides  of  an  erroneous  area  to 
be  brought  to  the  assistance  of  the  parser. 

There  is  another  way  of  stating  this  argument.  Namely,  the  local  approach  allows  the  highest 
phonetic  distance  scans  to  be  made  taking  advantage  of  the  maximum  possible  amount  of  lower 
phonetic  distance  (well  matching)  information  available.  Thus,  the  higher-distance  scans  are 
"data  directed"  as  completely  as  possible.  (This  is  not  true  of  a  left-to-right  approach.) 

In  particular,  the  algorithm  that  connects  PPTs  has  available  to  it  all  possible  information 
from  previous  lower-level  parsing  and  processing.  As  a  result,  all  partial  parse  structures 
constructed  from  lower  phonetic  distance  processing  can  be  examined,  and  the  most  promising 
connections  can  be  tried  first.  This  is  a  very  powerful  capability,  and  one  which  no  left-to-right 
algorithm  can  achieve. 

In  a  left-to-right  system,  it  would  be  much  more  difficult  for  the  system  to  know  wnen  a  high- 
distance  match  might  be  desirable,  especially  if  one  wants  to  do  such  high-distance  matching 
quite  selectively.  It  is  certainly  impossible  to  get  information  as  to  the  right-hand  boundary  for 
sue  a  match. 

Syntactic  Economy 

A  second  issue  might  be  called  "syntactic  economy."  Syntactic  conciseness  or  economy  is 
a  vague  term,  especially  in  reference  to  natural  language  parsing  where  there  are  many  different 
formalisms  for  defining  syntactic  relationships:  transformations  with  side  conditions,  transition 
nets  augmented  by  tests  on  arcs,  PROGRAMMAR  procedures,  etc.  Syntactic  conciseness  is  usu¬ 
ally  a  qualitative  judgment  made  subjectively  by  looking  at  a  grammatical  representation  of  a  let 
of  linguistic  fac's. 

An  interesting  aspect  of  local  parsing  as  described  here  is  that  it  provides  motivation  for 
talking  explicitly  about  a  particular  type  of  syntactic  economy:  namely,  the  degree  to  which  the 
parsing  logic  for  similar  structures  is  collapsed  down  to  a  minimum  syntactic  skeleton. 
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In  the  LPARS  local  parser,  there  is  very  real  incentive  to  try  to  collapse  the  grammatical 
skeleton  as  much  as  possible,  since  every  time  a  local  syntactic  structure  is  recognized,  i*  is 
asserted  for  every  instance  that  occurs  in  the  grammar.  Therefore,  if  the  grammar  contains 
thirty  oeparate  instances  where  a  nou1'- phrase  can  be  followed  by  a  verb,  when  such  a  pattern 
is  recognized,  all  thirty  p.  isibilities  must  be  constructed,  even  though  iriry  of  them  might  never 
even  be  considered  in  a  left- to- right  system  analyzing  a  text  version  of  the  sentence. 

Such  "ineconomy"  would  lead  to  excessive  inefficiency  >n  the  construction  of  local  PPTs  and 
would  also  complicate  the  job  of  connecting  different  segments  since  there  would  be  so  many  to 
choose  from. 

LPARS  achieves  conciseness  by  using  a  transition  net  grammar  augmented  by  syntactic  fea¬ 
tures.  The  transition  net  itself  is  a  very  powerful  tool  for  achieving  syntactic  conciseness  since 
it  can  capture  very  naturally  the  regularities  of  many  different  permutations  of  a  basic  structure. 
BNF,  for  example,  does  not  have  this  capability  because  it  provides  no  mechanism  similar  to  the 
transition  net's  ability  to  let  pa- sing  paths  diverge  and  later  reconverge. 

To  achieve  syntactic  compactness  in  a  systematic  fashion,  th -  transition  net  is  married  in 
LPARS  to  a  3ystem  of  syntactic  features  which  allow  a  number  of  different  permutations  of  a 
sentence  (such  as  relative  clauses,  declarative  sentences,  questions,  etc.)  to  be  consolidated 
into  a  single  transition  net  structure.  Thus,  an  efficient  syntactic  skeleton  can  be  created  for 
local  parsing. 

3.  Local  Representation  of  Global  Relationships 

An  interesting  problem  inherent  in  the  local  approach  is  that  of  lo-ally  representing  different 
possible  global  relationships  between  adjacent  syntactic  elements.  This  problem  arises  because 
the  global  relationship  between  adjacent  words  often  cannot  be  determined  from  limited  context. 

LPARS  handles  this  problem  in  three  ways: 

(a)  One  mechanism  used  is  the  ancestor  link  described  in  S'-c.  V-B.  Ancestor  links 
are  used  to  represent  potential  ambiguity  caused  by  right  i  ;cursion  in  the  grammar. 

(b)  LPARS  also  takes  advantage  of  the  syntactic  economy  acnieved  by  collapsing  a 
number  of  different  syntactic  patterns  down  to  a  single  sj  .itactic  skeleton.  Thus, 
a  number  of  potential  global  relationships  can  be  combined  in  a  single  local 
structure. 

(c)  The  third  mechanism  is  the  fallback  of  constructing  several  differen  parse  trees 
if  different  parses  of  some  set  of  words  cannot  be  convenicn‘ly  merged  by  the  two 
techniques  just  described.  This  fallback,  although  less  desirable,  is  not  too  ineffi¬ 
cient  if  it  is  only  necessary  on  a  limited  basis. 

4.  Formalism  vs  Procedures  in  Parsers 

A  very  interesting  issue  brought  to  light  concerns  the  relative  merits  of  expressing  logic 
in  the  form  of  a  nonprocedural  formalism  (such  as  a  transition  net  grammar),  or  in  the  form  of 
user-  written  procedures  (such  as  LPARS1  LOCALMATCH  routine). 

A  number  of  speculations  made  in  the  present  work  indicate  that  the  type  of  probabilistic, 
semantically  oriented  logic,  which  might  be  used  in  the  LOCALMATCH  routine  to  search  for 
local  structures  that  descr  ibe  objects  and  events  in  a  scere,  might  be  written  most  conveniently 
in  the  form  of  procedures,  rather  than  written  to  operate  from  an  arbitrary  transition  net.  Much 


of  this  logic  could  be  quite  divorced  from  the  syntactic  representation  of  the  phrase  involved 
(Remember  that  the  goal  of  the  LOCALMATCH  routine  is  not  to  perform  a  systematic  search  for 
a  globally  consistent  sentence,  but  ra.her  the  use  of  heuristics  to  find  plausible  structures  in 
different  parts  of  the  sentence.) 

Once  these  local  structures  are  found,  however,  the  system  needs  to  have  a  means  of  assur¬ 
ing  overall  consistency  and  global  compatibility  of  the  entire  utterance  and  to  direct  attention  to 
areas  which  might  be  investigated  to  attain  this  global  consistency.  The  transition  net  formalism 
is  used  to  achieve  this.  It  would  be  much  more  difficult  to  write  the  logic  that  directs  this  proc¬ 
essing  (the  CONNECT  algorithm)  in  purely  procedural  form,  without  som  structural  skeleton 
such  as  a  transition  net  to  work  from.  This  would  be  difficult  beet,  se  the  logic  that  pieces  the 
entire  sentence  together  must  be  able  to  start  processing  at  any  place  in  the  sentence,  and  must 
be  able  to  join  together  any  two  arbitrary  sentence  fragments  by  proposing  words  between  them. 

A  transition  net  grammar  provides  a  very  natural  framework  to  structure  this  logic  around.  It 
is  hard  to  imagine  how  one  uld  write  such  logic  v.’ithout  some  such  abstract,  manipulab’e,  non¬ 
procedural  representation  of  the  grammar. 

Thus,  procedurer  seem  to  be  the  best  medium  (in  tne  current  state  of  the  art)  for  express¬ 
ing  the  logic  that  uses  semantic  and  pragmatic  information  to  help  100k  foi  local  structures, 
whereas  the  transition  net  formalism  is  best  suited  for  systematically  trying  to  enforce  global 
structural  consistency. 

5.  Implications  for  Hardware  Design 

It  is  interesting  to  consider  what  the  design  of  the  J.PARS  system  might  imply  ahout  the  de¬ 
sign  of  efficient  hardware  for  continuous  speech  processing.  One  of  the  most  striking  aspects 
of  the  LPARS  system  is  the  large  degree  of  initial  lexical  processing  done  in  the  initial  scan  and 
the  l.OCAI.MATCH  routine. 

The  lexical  ■  ^.ching  involves  a  great  deal  of  computation,  even  though  it  is  done  at  a  Ica 
phonetic  distance.  On  the  other  hand,  virtually  all  this  processing  can  be  done  in  parallel.  Thus, 
there  is  great  potential  for  performing  this  computation  very  quickly,  if  one  has  special- purpose 
hardware  which  allows  parallel  computation  on  a  massive  scale.  Such  a  system  might  contain 
many  special-purpose  mini-processors  especially  designed  to  perform  lexical  recognition.  Such 
hardware,  although  perhaps  not  entirely  feasible  at  present,  is  liable  to  become  perfectly  viaole 
in  the  near  future,  given  the  present  rate  at  which  costs  of  computational  hardware  are  dropping. 

6.  Parallel  Process  Independence  vs  Coordination 

It  is  worth  discussing  exactly  to  what  degree  LPARS  allows  parallel  independent  processing 
to  take  place  and  to  what  extent  processing  in  various  parts  of  the  utterance  is  coordinated. 

The  entire  processing  of  LPARS  is  globally  coordinated  in  the  sense  that  the  phonetic  dis¬ 
tance  thresholds  are  set  for  processing  in  all  parts  of  the  utterance,  in  a  globally  coordinated 
fashion. 

Within  this  overall  global  coordination  of  phonetic  distance  thresholds,  however,  the  proc¬ 
essing  in  the  initial  scan  consists  of  many  independent  lexical  matching  processes  which  are 
conceptually  independent.  Similarly,  the  LOCALMATCH  routine  consists  conceptually  of  a  num¬ 
ber  of  independent  processes  which  use  local  information  to  direct  their  actions. 

Once  this  initial  local  processing  ends,  more  global  processing  begins-  namclv,  the  construc¬ 
tion  of  PPTs  the  inspection  of  these  trees  from  all  parts  of  the  utterance,  and  the  attempt  to 
connect  them  together  bv  proposing  intervening  words,  using  the  most  promising  first.  The 
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nature  of  th.  later  processing  is  not  independent  or  potentially  parallel  since  it  requires  exami¬ 
nation  of  all  parse  structures  together  (to  see  which  is  most  promising  ind  which  input  segments 
are  worth  investigating  first). 


input 


partial 

parse 

trees  (global/ 


sequential) 

[connect  | 

* 

sentence 

candidates 

Fig.  VIII- 1. 

Thus,  in  LPARS,  the  distinction  between  local/independent  vs  global/sequential  processing 
is  clearly  marked.  This  distinction  separates  the  initial  scan/LOCALMATCH  processing  from 
the  analysis  done  by  the  TREE  and  CONNECT  algorithms  (see  Fig.  VIII- 1 ). 

P.  AREAS  FOR  FURTHER  WORK 

The  primary  focus  of  the  present  work  has  been  on  developing  a  local  approach  to  parsing 
which  allows  structures  to  be  built  up  in  all  parts  of  the  sentence,  so  that  the  system  can  explore 
(using  the  transition  network)  how  these  local  structures  might  be  pieced  together  in  a  globally 
consistent  fashion. 

1.  Elimination  of  Current  Restrictions 

There  are,  of  course,  a  number  of  restrictions  in  the  scope  of  the  present  work.  A  number 
of  potential  areas  for  further  work  are  extensions  of  the  system  which  remove  these  restrictions: 

(a)  Only  a  limited  syntax  is  used.  It  would  be  interesting  to  explore  the  degree  to  which 
the  problems  of  local  parsing  are  compounded  when  a  much  larger  syntax  is  used, 
and  to  explore  techniques  useful  in  dealing  with  these  problems. 

(b)  Only  a  relatively  small  vocabulary  is  used  by  LPAPS.  It  would  be  interesting  to 
explore  the  problems  that  arise  as  the  vocabulary  is  substantially  increased. 
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I e)  Tne  LPARS  system  uses  scrambled  input  rather  than  input  produced  by  a  real  front 
end.  An  obv:ous  extension  is  to  join  LPARS  to  a  working  front  cud,  at  which  point 
one  could  begin  to  deal  with  such  phenomena  as  coarticulation,  st  'erscgmental  ef¬ 
fects,  etc.  into  the  LPARS  lexical  matching  scheme. 

The  most  interesting  extensions  of  the  present  work  arc  those  to  the  semantic  component  of 
LPARS.  Semantic  extensions  could  be  incorporated  at  almost  all  levels  of  the  LPARS  system. 

In  general,  these  extensions  have  been  mentioned  as  appropriate  in  several  sections  of  this  re¬ 
port.  Such  proposed  extensions  generally  involve  using  the  context  of  a  scene  being  discussed 
and  of  an  interactiv  dialog  to  assist  in  the  processing.  Context  of  this  sort  might  assist  in  the 
following  wavs. 

(a)  Allow  the  system  to  do  its  processing  using  only  a  portion  of  its  entire  vocabulary 

(b)  Augment  the  LOCALMATCH  routine  to  look  for  local  structures  which  described 
objects  or  events  in  the  scene. 

(c )  Augment  the  action  routines  so  that  noun-phrases  and  clauses  are  recognized  only 
if  thev  make  sense  in  terms  of  the  scene  being  described. 

2.  Tradeoffs  in  a  Higher- Level  Speech  System 

As  discussed  in  Sec.  1,  high-level  systems  such  as  LPARS  con  make  an  important  comn'ju- 
tion  bv  providing  a  medium  in  which  one  can  begin  to  evaluate  var.ous  tradeoffs  involved  i.i  build¬ 
ing  an  integrated  speech  system  -  tradeoffs  involving  such  things  as: 

(a)  The  quality  of  front-end  information, 

(b)  The  size  of  the  vocabulary, 

(c)  The  complexity  of  the  syntax, 

(u )  The  sophistication  of  the  semantic  routines  available  to  assist  the  processing, 

(e)  The  frequency  of  occurrence  of  small  function  words  that  must  be  handled,  and 

if)  The  degree  to  which  the  vocabulary  can  be  partitioned. 

Once  a  concrete  feeling  for  such  tradeoffs  is  obtained,  one  can  begin  to  get  a  feeling  for 
where  research  effort  can  be  most  productively  allocated.  It  would  be  interesting  to  systemati¬ 
cally  explore  some  of  these  tradeoffs. 

3.  Integration  of  Supra-segmental  Information 

Another  very  interest*  ng  area  for  further  work  is  the  integration  of  the  use  of  supra-segmental 
information  into  a  >ocal  system  such  as  LPARS.  In  a  speech  waveform,  there  is  a  great  deal  of 
such  information  which  could  help  indicate  where  syllables  are,  where  stressed  morphs  are.  where 
phrase  boundaries  are,  and  possibly  even  information  about  the  overall  structure  of  the  sentence. 

Such  information  could  be  very  helpful  in  allowing  a  system  to  establish  "islands  of  relia¬ 
bility"  from  which  to  work  in  various  parts  of  a  sentence.  One  would  suspect,  therefore,  that 
such  information  could  be  utilized  particularly  effectively  in  locally  organized  systems  like 
LPARS.  Without  local  organization,  even  if  one  did  use  local  supra-segmental  information  one 
would  eventually  have  to  fall  back  on  a  left-to-right  approach. 

Hopefully,  the  development  of  locally  organized  parsing  systems  will  give  added  impetus  to 
research  into  promising  supra-segmental  effects,  since  such  parser  organization  would  provide 
a  natural  vehicle  for  exploiting  such  information  effectively. 


9  3 


appends;  a 

THE  PHONEME  SIMI1.ARITY  TABLE 


Appendix  A  descriaes  the  phonemes  used  in  the  LPARS  system,  and  the  phoneme  similarity 
table. 

I.  LPARS'  PHONEMES 

Table  A-l  lists  the  phoneme  set  used  in  our  work.  With  each  symbol  is  the  1PA  symbol  and, 
if  not  obvious,  a  sample  pronunciation. 


II.  PHONEME  SIMILARITY  TABLE 

1 9 

Table  A-E  is  a  reproduction  of  the  table  in  Appendix  10  of  the  ARPA  Speech  Report.  This 
phoneme  similarity  table  is  used  by  the  lexical  scrambler  and  word-recognition  algorithm  as 
described  in  Sec.  III. 
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TABLE  A-2 
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appendi.c  n 

LPARS  VOCABULARY  AND  TEST  SENTENCES 


Appendix  R  lists  the  words  in  LPARS'  vocabulary  with  their  parts  of  speech  and  phonetic 
spellings,  and  the  sentences  used  in  LPARS'  evaluation. 


1.  LPARS  VOCABULARY 

(DISLIKE  VERB  D  I  S  L  A  1  K) 

(ENJOY  VERB  SVV  N  J  O  1) 

(HANDLE  VERB  II  AE  N  n  ,,) 

(DISCARD  VERB  D  1  S  K  AR  b) 

(CLUTTER  VERB  K  L  SVV  T  R) 

(COMPLEMENT  VERB  K  A  M  P  L  SVV  M  SVV  N  T) 
(ILLUMINATE  VERB  I  L  CO  M  I  N  AA  T) 

(BRIGHTEN  ERB  R  R  A  I  T  N) 

(CONCEAL  VERB  C  SVV  N  S  EE  L) 

(CONTAIN  VERB  K  SVV  N  T  AA  N) 

(SUPPORT  VERB  S  SVV  PORT) 

(OBSCURE  VERB  A  B  S  K  I  OO  Rl 
(PLACE  VERB  P  L  AA  S) 

(APPROACH  VERB  A  P  R  O  CH) 

(CHRISTOPHER  PROPN  KRIST  SVV  F  R) 

(MARIANNE  PROPN  M  E  R  I  AE  N) 

(ROBERT  PROFN  R  SVV  B  SW  R  T) 

(PERIWINKLE  PROPN  P  SW  R  EE  W  INK!' 

(ANASTASIA  PROPN  E  N  E  S  T  AA  S  EE  SW  > 

(CHANDELIER  NO-  N  Sii  AE  N  D  SVV  l.  EE  R) 

(FLASH?. IGIiT  NOUN  F  L  AE  Sii  L  A  i  T) 

(TABLEI.AMP  i,OUN  T  AA  B  I.  AE  M  P) 

(FIREPLACE  NOUN  FAIilPLAASi 
(S1DETABLE  NOUN  S  A  I  D  T  AA  B  L) 

(COFFEETABLE  NOUN  K  AW  F  EE  T  Ah  B  L) 
(ARMCHAIR  NOUN  A  R  M  CH  AA  R) 

(BOOKCASE  NOUN  BUKAA  S) 

(IVADIATOR  NOUN  R  AA  D  EE  AA  T  O  R) 

(NEWSPAPER  NOUN  NOOSPAAPRl 
(MAGAZINE  NOUN  M  AE  C,  SW  Z  EE  N) 

(ASHTRAY  NOUN  A  SII  T  R  AA) 

(DICTIONARY  NOUN  D  I  K  SH  SVV  N  AA  R  EE) 
(REFRIGERATOR  NOUN  R  EE  F  R  I  D  J  SVV  R  AA  T  O  R) 
(CABINET  NOUN  K  E  B  1  N  SW  T) 

(PERSON  NOUN  P  SW  R  S  SW  N) 

(TELEVISION  NOUN  T  SW  L  SW  V  I  J  SVV  N) 
(BREAKFASTTA.JLE  NOT  N  B  R  SW  K  F  E  S  T  AA  B  L) 
(WASTEBASKET  NOUN  VV  AA  S  T  B  E  S  K  SW  T) 
(STEPLADDER  NOUN  S  T  SW  P  L  E  DD  SW  R) 
(FOOTSTOOL  NOUN  FUTSTOOL) 

(VACUUMCLEANER  NOUN  V  EKIOOM"LEEN  SW  II) 
(ENTRANCE  NOUN  SW  N  T  R  E  N  S) 

ONFRONTOF  PREP  I  N  F  R  SW  N  T  SW  V) 

(ONTOPOF  PREP  AW  N  T  A  P  SW  V) 

(BESIDE  PREP  B  EE  S  A  I  D) 

HNBACKOF  PREP  I  N  B  E  K  SW  V) 

(NEXTTO  PREP  N  SW  K  S  T  OO) 

(ONTHERIGHTOF  PREP  AW  N  TH  SW  R  A  I  T  SW  V) 
(ONTHELEFTOF  PREP  AW  N  TH  SW  L  SW  F  T  SVV  V) 
(BEHIND  PREP  B  EE  II  A  I  N  D) 

(RECTANGULAR  AD.I  R  SW  K  T  E  N  G  I  OO  L  A  R) 
(CIRCULAR  ADJ  S  I  R  K  I  OO  L  A  R) 

(BROWN  ADJ  B  R  A  OO  N) 
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(ORANGE  ADJ  ORAN  J| 

(GREEN  ADJ  G  R  EE  N) 

(FRIENDLY  ADJ  F  R  SW  N  D  L  EE) 

(DOES  DODl'ZI 
(WHO  WH  H  OO) 

(WHICH  WH  W  I  CH) 

(WHOM  WH  H  OO  M) 

(WHAT  WH  W  A  T) 

(IS  BE  1  Z) 

(THE  DET  TH  SW) 

(BY  BBY  BA1) 

(S  AFXS  Z) 

(D  AFXS  T) 

(O  AFXS  ) 

(IN  PREPA  1  N) 

(ON  PREPA  AW  N) 


II.  SENTENCES  USED  IN  LPARS'  EVALUATION 


(♦he  dictionary  infrontof  the  sidetable  is  place  d  in  ths 
wastebasket  by  periwinkle) 

(the  wastebasket  which  the  green  coffeetable  support  s 
contain  s  the  dictionary) 

(the  coffeetable  which  support  s  the  green  ashtray  obscure  s 
the  entrance ) 

(the  green  coffeetable  support  s  the  dictionary) 

(the  sidetable  which  support  3  the  ashtray  obscure  s  the 
fireplace ) 

(the  sidetable  which  support  s  the  ashtray  in  the  bookcase 
obscure  s  the  fireplace) 

(does  the  sidetable  support  the  ashtray) 

(what  does  the  sidetable  support ) 

(the  dictionary  on  the  sidetable  is  brighten  d  by  the 
chandelier) 

(does  the  chandelier  infrontof  the  fireplace  brighten  the 
rectangular  coffeetable) 

(the  chandelier  infrontof  the  fireplace  brighten  s  the 
coffeetable) 

(what  does  the  chandelier  infrontof  the  fireplace 
brighten) 

(the  coffeetable  nextto  the  bookcase  is  illuminate  d  by 
the  tablelamp) 

(is  the  dictionary  support  d  by  the  sidetabie) 

(does  periwinkle  place  the  dictionary  on  the  coffeetable) 

(what  does  periwinkle  place  beside  the  refrigerator) 

(what  support  s  the  dictionary  ontherightof  the  fireplace) 
(place  the  circular  ashtray  in  the  wastebasket) 

(the  dictionary  which  is  support  d  by  the  coffeetable 
support  s  the  tablelamp  which  illuminate  s  the  fireplace) 
(who  dislike  s  periwinkle) 

(the  friendly  person  place  s  toe  ashtray  inbackof  the 
cabinet) 

(periwinkle  place  s  the  dictionary  on  the  sidetable  beside 
the  bookcase) 

(place  the  green  dictionary  on  the  bookcase) 

(the  coffeetable  beside  the  armchair  infrontof  the 
fireplace  support  s  the  ashtray) 

(the  tablelamp  beside  which  robert  place  s  the  dictionary 
illuminate  s  the  armchair) 
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(the  refrigerator  in  which  the  ashtray  is  place  d  by 
robert  conceal  s  the  entrance) 

(whom  does  marianne  dislike) 

(on  what  does  cnristopher  place  the  dictionary) 

(beside  what  does  Christopher  place  the  ashtray) 

(beside  what  is  the  green  rectangular  ashtray  place  d  by 
robert) 

(is  robert  dislike  d  b  the  friendly  person  beside  the 
coffeetable) 

(what  is  support  d  by  the  circular  sidetable) 

(what  is  place  d  beside  the  newspaper  on  the  coffeetable 
by  Christopher) 

(on  what  docs  periwinkle  place  the  tablelamp) 

(beside  what  does  the  friendly  person  place  the 
magazine) 

(infrontof  what  is  the  magazine  place  d  by  marianne) 

(in  what  is  the  newspaper  place  d  by  robert) 

(the  bookcase  contain  s  the  flashlight) 

(the  flashlight  illuminate  s  the  circular  wastebasket 
beside  the  television) 

(the  magazine  is  support  d  by  t'.ie  dictionary  ontopof  the 
coffeetable ) 

(the  newspaper  on  the  sidetable  conceal  s  the  ashtray) 
(the  brown  sidetable  beside  the  fireplace  conceal  s  the 
armchair) 

(place  the  green  dictionary  beside  the  refrigerator) 
(place  the  dictionary  which  support  s  the  ashtray  beside 
the  cabinet) 

(does  the  newspaper  conceal  the  circular  ashtray) 

(does  Christopher  enjoy  marianne) 

(what  does  the  tablelamp  infrontof  the  fireplace 
illuminate) 

(place  the  ashtray  infrontof  the  sidetable) 

(what  brighten  s  the  breakfasttable ) 

(the  bookcase  behind  the  coffeetable  support  s  the 
magazine) 
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APPENDIX  C 

WORD- RECOGNITION  ALGORITHM 


Appendix  C  describes  the  routine  used  for  lexical  recognition.  This  routine  receives  the 
following  arguments 


WORD 

SENT 

I 

wi;  TH 

SLNTK 

WDIST 


the  correct  phonetic  spelling  of  the  word  to  be 
matched  (an  array  of  characters). 

the  i  i.onetic  spelling  of  the  scrambled  input 
utterance  an  array  of  characters). 

the  phoneme  position  in  the  sentence  at  which 
the  match  is  to  be  started. 

the  length  of  the  word, 
the  length  of  the  sentence. 

the  maximum  distance  permissible  for  the  match. 


The  algc.  thm  calls  the  following  routines: 

SUBDIST(PHONl,  PHON’2)  returns  the  phonetic  distance  of  substitution  between  two 
phonemes. 

JELDlST(P!ION,  LASTPHON)  returns  the  phonetic  distance  of  deletion  of  PHON 
when  preceded  by  LASTPHON. 

^TACKfJ,  K.  DIST)  puts  three  values  onto  a  stack. 
rfNSTACK(  I,  K.  DIST)  pop3  the  three  values  from  the  stack. 

The  algorithm  explores  all  possibilities  of  substitution  and  deletion  in  matching  the  word  to 
the  section  of  input  which  starts  at  position  I. 
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LEX(WORD,SENT,l,WLNTH,SLNTH,WDIST; 


START 
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APPENDIX  D 

TREE  AND  CONNECT  ALGORITHMS 

Appendix  D  contains  flowcharts  of  the  logic  of  the  TREE  and  CONNECT  algorithms.  In 
creating  these  flowcharts,  we  have  tried  to  leave  in  enough  detail  to  convey  the  basic  operations 
being  performed,  but  not  no  much  as  to  obscure  them.  The  algorithms  described  in  this  Appendix 
are  written  to  operate  on  LISP  data  structures  flints);  therefore,  they  use  the  following  LISP 
functions : 

CAR,  which  obtains  the  first  member  of  a  list, 

CDR,  which  obtains  the  remainder  of  the  list, 

LIST,  which  returns  a  list  of  the  elements  passed  to  it.  and 
APPEND,  which  appends  two  lists. 

The  algorithms  also  invoke  the  following  functions  (listed  alphabetically): 

ADDSON( PPT, SON,  ARC)  returns  a  new  PPT  created  by  adding  a  rightmost  son 
to  the  input  PPT  (providing  that  the  action  routine  associated  with  the  ARC  permits, 
else  NIL). 

ANCESTOR(PPT)  returns  true  if  the  leftmost  son  of  the  top  node  of  the  PP1'  is  an 
ancestor  link  son. 

CATPATH(LPT,  RPT)  returns  a  new  PPT  consisting  of  r.  concatenation  of  the  two 
input  PPTs  (providing  that  the  action  routines  are  compatible,  else  NIL).  If  the 
leftmost  son  of  the  RPT  is  an  ancestor  link  son,  it  is  not  included  in  the  resulting 
structure. 

CONTENTW(ARC)  returns  true  if  the  ARC  is  labeled  bv  a  content  word  terminal. 
CONTEXTLIST(PPT)  returns  a  list  of  all  arcs  which  invoke  the  PPT. 

E NDSTA T E ( P PT )  returns  the  end  state  of  the  top  node  of  the  PPT. 

ENDTEST(ENDFI  AG,  PPT)  returns  true  if  ENDFLAG  is  NIL  or  if  the  PPT  is  a 
final  PPT. 

EXITAACSfPPT)  returns  a  list  of  all  arcs  emanating  from  the  end  state  of  the  top 
node  of  the  PPT. 

FINAL)  PPT)  returns  true  if  PPT  ends  on  a  permitted  final  state  or  is  a  terminal 
node. 

I  IRSTARC(PPT)  returns  the  first  arc  in  the  top  node  of  the  PPT. 

INITLAL(PPT)  returns  true  if  the  PPT  begins  on  a  permitted  initial  state. 

INITSTATES(ARC)  accepts  as  input  an  arc  invoking  a  nonterminal,  and  returns  a 
list  of  initial  states  of  that  nonterminal's  net. 

LAMBDA(ARC')  returns  true  if  the  ARC  is  a  LAMBDA  arc. 

LASTARC(PPT)  returns  the  righ*most  arc  in  the  top  node  of  the  PPT. 

LSON(PPT)  returns  the  leftmost  son  of  the  PPT,  or  NIL  if  the  PPT  is  a  terminal 
node. 
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NT(aRC)  returns  true  if  the  ARC  invokes  a  nonterminal. 

POPPPT(PPT,  ARC)  returns  a  new  PPT,  constructed  by  popping  with  the  PPT  to 
an  ARC  invoking  the  PPT  in  a  higher-level  net. 

PPT(ARC)  accepts  a  terminal  arc  and  constructs  a  "hypothetical"  terminal  PPT  of 
the  san.e  part  of  speech  as  the  arc. 

PTSW f.'.  if  N  is  an  arc  or  a  PPT,  this  function  returns  the  part  of  speech  asso¬ 
ciated  with  'hat  i  rc  of  PPT. 

RIrPIR5T3()N(PPT/  returns  ML  if  the  top  node  of  PPT  has  only  one  son,  or  re¬ 
turns  a  new  PPT  identical  to  the  input  PPT  but  minus  the  leftmost  son  of  the  top 
node. 

RIPLASTSON(PPT)  is  similar  to  R1PK1RSTSON  but  removes  the  rightmost  son. 

RSON(PPT)  returno  the  rightmost  son  of  the  PPT,  or  ML  if  the  PPT  is  a  te.-minal 
node. 

STARTPATTKP,  ARC)  returns  a  new  PPT  created  by  starting  a  path  in  a  net  (w'here 
ARC  emanates  from  an  initial  state  of  t‘”'t  net).  P  becomes  the  only  son  in  this 
new  PPT. 

STARTSTATE(PPT)  returns  the  starting  state  of  the  top  node  of  the  PPT. 
STORE(PPT)  storet  a  PPT  in  the  pho  .erne  position  {able. 

STORE LlS‘r<PPTL)  store'  a  list  of  PPTs  in  the  phoneme  position  table. 

TERMT(PPT)  returns  true  if  the  PPT  is  a  terminal. 

i.  1  REE  ALGORITHM 

The  basic  top-level  routine  of  the  TREE  algorithm  is  the  TREE-WORD  routine  which  takes 
as  input  a  PPT  and  a  word  and  constructs  as  many  possible  resulting  PPTs  as  possible  by  'append¬ 
ing,  pushing,  and  popping. 

The  HOOK  routine  coordinates  all  appending  and  pushing  connections  beiween  the  top  node 
of  a  PPT  and  the  word.  If  a  given  connection  is  not  possible,  the  routine  must  try  to  effect  the 
connection  to  a  rightmost  sub-part  of  that  node. 

The  PI  SH  routine  h  .  "es  pushing  connections. 

The  HOOKAl.L  routine  attempts  to  "hook"  the  word,  not  only  to  the  top  r.  <de  of  the  PPT, 
but  also  to  lower  rightmost  nodes.  Also,  it  checks  to  see  if  lower  rightmost  lescendents  of  the 
PPT  can  be  joined  to  the  word  in  other  syntactic  contexts 

The  POP  routine  handles  popping  connections.  In  the  process,  it  calls  the  ABORT  routine 
which  prevents  infinite  looping,  and  the  POPPPT  routine  which  must  alao  handle  creating  ances¬ 
tor  link  structures. 
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«VORD(PPT,WORD) 


START 


-<^7|NAL(PPT)?^> 

Y 


C=CONTEXTLIST(PPT) 

WHILE  C  ft  NIL  DO 
|  POP(PPT,WORD,CAR(C),NIL) 
C=CDR(C)  1 


STORtLIST(HOOKALL(PPT,  WORD)) 

Y~ 

DONE 


HOOK(PPT.WORD) 


START 


TRY(PPT(  SON,  ARC) 


START 


PUSH(ARC,  WORD) 


START 
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HOOKALL(PPT,WORD) 


/FINAL(RSON(PPT))?, 


Ti=RSON(PPT)]— 


RES=HOOK(PPT,WORDn 


FINAL(TI)? 


C=CONTEXTLIST(TI) 


C=NIL?: 

“ncz : 


TI=NIL? 


C=CDR(C)  j-^<C=LASTARC(PPT)?] 


jPOP(T1,  WORD,  ARC,  NIL) 1 


T2-HOOKALL(Tl,WORD) 

T3=RlPLASTSON(PPT) 


WHILE  T2  j=  NIL  DO 
[  RES=APPFND(RE$, 

ADDSON(T3,CAR(T?),LASTARC(PPT) )) 
T2=CDR(T2)  ] 


WHILE  T2  yt  NIL  DO 

|  RES=APPEND(RES,TRY(T3,CAR(T2),LASTARC(PPT)) ) 
T2<DR(T2)  | 


POP(PPT,  WORD,  ARC,  ARCL1ST) 


>BORT(PPT,ARC,ARCLIST)?, 

N 

T  l=POPPPT(PPT,  ARC) 
T2=HOOK(TI  ,WORD) 
STORELIST(T2) 


|  RETURN] 


<FI  NAL(T  I )  ry— ^  RE  TURN  | 


<~=CONTEXTLIST(TI) 

WHILE  C  /  NIL  DO 

[  POP(TI  ,WORD,CAR(C),  APPEND(ARCLIST,  LIST(ARC))) 
C=CDR(C)  ) 
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II.  CONNECT  ALGORITHM 


The  following  flowcharts  outline  the  two  main  routines  of  the  JOIN  sub-part  of  the  CONNECT 
algorithm. 

The  EXTEND  routine  coordinates  the  extending  of  a  path  from  an  LPPT  and  the  joining  of 
that  path  to  an  RPPT, 

The  PUSHEX  routine  is  called  with  1NTFLAG=N1L  when  the  extended  path  is  to  continue  on  a 
lower  level,  and  is  called  with  INTFLAG=TRUE  to  make  anRPFT  start  on  a  permitted  initial  state. 

The  DOREST  routine  is  called  by  the  EXTEND  routine  after  the  paths  have  been  joined. 

It  assures  that  the  rightmost  sons  of  the  LPPT  end  on  permitted  final  states,  and  call  the 
CONSTRAIN  routine. 

Three  arguments  are  used  for  communication  between  these  routines: 

CONTFLAG  is  true  when  an  extended  path  has  already  traversed  a  content  word, 
ENDFLAG  is  true  if  the  extended  path  must  end  on  a  final  state,  and 
1NTFL4G  is  used  in  the  PUSHEX  routine  as  described  above. 
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EXTENDS,  RPT,ENDFLAG,CONTFLAG,INTFLAG) 


START 


107 


PUSHEX(RPT,  AR,  ENDFLAG,  CONTFL  AG,  I NTFLAG) 


START 
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APPENDIX  E 
LOCALMATC11  LOGIC 


Appendix  E  describes  thehigher  phonetic  distance  lexical  matching  done  bv  the  LOCALMATC1I 
routine.  This  matching  is  done  in  areas  adjacent  to  and  between  words  found  either  bv  the  initial 
scan  or  bv  previous  local  matching. 

Three  types  of  matching  are  done: 

(1)  Matching  for  words  immediately  adjacent  to  a  word  already  found. 

(2)  Matching  for  nearby  words  separated  by  only  a  few  phonemes  from  a  word  already 
found. 

(3)  Matching  for  function  word-strings  in  the  area  between  two  words  already  found. 

1.  ADJACENT  MATCHING 

Below,  we  indicate  for  each  word  class  which  words  are  searched  for  immediately  adjacent 
on  each  side: 


NOUN 

left  -  adj 
right  -  verb,  prep 

VERB 

left  -  noun,  propn 

ADJ 

left  -  adj 
right  -  adj,  noun 

PROPN 

right  -  verb 

PREP 

left  -  noun 

ENDCH 

left  -  ncuu,  propn,  verb 

11.  NEARBY  MATCHING 

Below,  we  indicate  for  each  ■word  class  the  words  which  are  searched  for  in  areas  separated 
by  only  a  few  phonemes  on  each  side: 


NOUN 

left  -  verb. 

prep 

VERB 

left  -  verb 

right  -  noun. 

adj 

ADJ 

left  -  verb. 

prep 

PREP 

right  -  noun 

111.  FUNCTION  WORD- STRINGS  BETWEEN  WORDS 

Below,  we  show  the  set  of  function  word-strings,  and  the  word-pair  contexts  between  which 
each  is  matched  for.  (These  tests  are  made  only  if  the  two  words  are  separated  by  a  roughly  ap¬ 
propriate  number  of  phonemes.) 
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the 


in  the 


strch  adj/noun 

(i.e.,  between  a  strch  and 
an  adj  or  a  noun) 
prep  adj/noun 
verb  adj/noun 

is 

strch  propn 
which 

prep  propn 
noun  propn 
noun  verb 


verb  adj/noun 
noun  adj/noun 

which  the 

noun  adj/noun 
prep  adj/noun 

whom  the 

noun  adj/noun 

is  the 

strch  adj/noun 
does  the 


by 


strch  adj/noun 


noun  propn 
verb  propn 

does 

strch  propn 
strch- prep  propn 

(i.e.,  between  a  prep,  at 
the  start  of  th«.  sentence, 
and  a  propr.j 


in  which  the 

noun  adj/r.oun 
on  which  the 

noun  adj/noun 
what  does  the 

strch  adj/noun 
streh-prep  adj/noun 


which  is 


what  is  the 


noun  verb 
what  does 

strch  propn 
streh-prep  propn 

whom  does 

strch  propn 
on  what  does 
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